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(57) Abstract: The present invention provides human, rat and mouse NPCfLl polypeptides and polynucleotides encoding the 
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included is a method of identifying ligands which bind to NPCILI using membranes derived from brush border membrane prepara- 
tions. Compounds that bind to NPCILI can be used for inhibiting intestinal cholesterol absorption in a subject. 
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NPC1L1 (NPC3) AND METHODS OF IDENTIFYING 

LIGANDS THEREOF 

SPECIFICATION 

5 The invention claimed herein was made on behalf of Merck & Co., 

Inc. and Schering-Plough Corporation, parties to a joint research agreement that was 
in effect on or before the date the claimed invention was made. 

This application claims priority to Serial No. 60/537,341, filed 

January 16, 2004. 

10 FIELD OF THE INVENTION 

The present invention includes NPC1L1 polypeptides and 
polynucleotides which encode the polypeptides, methods of use and methods of 
identifying modulators and ligands thereof. 

BACKGROUND OF THE INVENTION 
15 a factor leading to development of vascular disease, a leading cause of 

i 

death in industrialized nations, is elevated serum cholesterol. It is estimated that 19% 
of Americans between the ages of 20 and 74 years of age have high serum cholesterol. 
The most prevalent form of vascular disease is arteriosclerosis, a condition associated 
with the thickening and hardening of the arterial wall. Arteriosclerosis of the large 

20 vessels is referred to as atherosclerosis. Atherosclerosis is the predominant 
underlying factor in vascular disorders such as coronary artery disease, aortic 
aneurysm, arterial disease of the lower extremities and cerebrovascular disease. 

Cholesteryl esters are a major component of atherosclerotic lesions and 
the major storage form of cholesterol in arterial wall cells. Formation of cholesteryl 

25 esters is also a step in the intestinal absorption of dietary cholesterol. Thus, inhibition 
of cholesteryl ester formation and reduction of serum cholesterol can inhibit the 
progression of atherosclerotic lesion formation, decrease the accumulation of 
cholesteryl esters in the arterial wall, and block the intestinal absorption of dietary 
cholesterol. 
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The regulation of whole-body cholesterol homeostasis in mammals and 
animals involves the regulation of intestinal cholesterol absorption, cellular 
cholesterol trafficking, dietary cholesterol and modulation of cholesterol biosynthesis, 
bile acid biosynthesis, steroid biosynthesis and the catabolism of the cholesterol r 
5 containing plasma lipoproteins. Regulation of intestinal cholesterol absorption has 
proven to be an effective means by which to regulate serum cholesterol levels. For 
example, a cholesterol absorption inhibitor, ezetimibe ( 




), has been shown to be effective in this regard. A pharmaceutical composition 

1 0 containing ezetimibe is commercially available from Merck/Schering-Plough 

Pharmaceuticals, Inc. under the trade name Zetia®. Identification of a gene target 
through which ezetimibe acts is important to understanding the process of cholesterol 
absorption and to the development of other, novel absorption inhibitors. The present 
invention addresses this need by providing a rat and a mouse homologue of human 

15 NPC1L1 (also known as NPC3; Genbank Accession No. AF192522; Davies, et al. 9 
(2000) Genomics 65(2): 137-45 and Ioannou, (2000) Mol. Genet. Metab.71(l-2): 
175-81), an ezetimibe target 

NPC1L1 is an N-glycosylated protein comprising a YQRL (SEQ ID 
NO: 38) motif (i.e., a /rans-golgi network to plasma membrane transport signal; see 

20 Bos, et at., (1993) EMBO J. 12: 2219-2228; Humphrey, et al. y (1993) J. Cell. Biol. 
120: 1123-1135; Ponnambalam, et aL, (1994) J. Cell. Biol. 125: 253-268 and 
Rothman, et al 9 (1996) Science 272: 227-234) which exhibits limited tissue 
distribution and gastrointestinal abundance. Also, the human NPC1L1 promoter 
includes a Sterol Regulated Element Binding Protein 1 (SREBP1) binding consensus 

25 sequence (Athanikar, et a/., (1998) Proc. Natl. Acad. Sci. USA 95: 4935-4940; 
Ericsson, et al 9 (1996) Proc. Natl. Acad. Sci. USA 93: 945-950; Metherall, et aL 9 
(1989) J. Biol. Chem. 264: 15634-15641; Smith, et ah, (1990) J. Biol. Chem. 265: 
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2306-2310; Bennett, et al. 9 (1999) J. Biol. Chem. 274: 13025-13032 and Brown, et 
al 9 (1997) Cell 89: 331-340). NPC1L1 has 42% amino acid sequence homology to 
human NPCl (Genbank Accession No. AF002O20), a receptor responsible for 
Niemann-Pick CI disease (Carstea, et aL 9 (1997) Science 277: 228-231). Niemann- 
5 Pick CI disease is a rare genetic disorder in humans which results in accumulation of 
low density lipoprotein (LDL)-derived unesterified cholesterol in lysosomes 
(Pentchev, et al. 9 (1994) Biochim. Biophys. Acta. 1225: 235-243 and Vanier, et aL y 
(1991) Biochim. Biophys. Acta. 1096: 328-337). In addition, cholesterol accumulates 
in the rrans-golgi network of npcl" cells, and relocation of cholesterol, to and from the 

10 plasma membrane, is delayed. NPC1 and NPC1L1 each possess 13 transmembrane 
spanning segments as well as a sterol-sensing domain (SSD). Several other proteins, 
including HMG-CoA Reductase (HMG-R), Patched (PTC) and Sterol Regulatory 
Element Binding Protein Cleavage-Activation Protein (SCAP), include an SSD which 
is involved in sensing cholesterol levels possibly by a mechanism which involves 

15 direct cholesterol binding (Gil, et aL, (1985) Cell 41 : 249-258; Kumagai, et aL, 

(1995) J. Biol. Chem. 270: 19107-191 13; Hua, et aL, (1996) Cell 87: 415-426; and 
Radhakrishnan, A., et aL, "Direct binding of cholesterol to the purified membrane 
region of SCAP: Mechanism for a sterol-sensing domain," MoL Cell 15, 259-268 
(2004)). 

20 SUMMARY OF THE INVENTION 

The present invention is based on the discovery that NPC1 LI is the 
target through which ezetimibe acts, and consequently plays a critical role in the 

■ 

regulation of sterol and 5a-stanol intestinal transport and absorption, e.g. cholesterol 
absorption. Accordingly, this invention provides for the use of NPC1L1 in an assay 

25 for identifying ligands that block NPClLl-mediated sterol and 5cc-stanol intestinal 
transport The present invention provides methods for identifying ligands of NPC1L1 
which involve contacting NPC1 LI with a detectably labeled substituted 
2-azetidinone, preferably substituted 2-azetidinone-glucuronide, and a candidate 
compound, and determining whether the candidate compound binds to NPC1L1 . The 

3 0 modulation of the binding of the substituted 2-azetidinone to NPC 1 LI by the binding 
of the candidate compound to NPC1L1 indicates that the candidate compound is a 
ligand that binds to NPC1L1 and is an inhibitor of sterol and 5ct-stanol absorption. 
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The present invention also provides a method for identifying a ligand 
of NPC1L1 comprising contacting NPC1L1 with a detectably labeled substituted 2- 
azetidinone, preferably substituted 2-azetidinone-glucuronide, and measuring the 
binding of detectably labeled substituted 2-azetidinone to NPC1L1 in the presence 
5 and absence of a candidate compound, wherein decreased binding of the detectably 
labeled substituted 2-azetidinone to the NPC1L1 in the presence of the candidate 
compound indicates that said candidate compound is a ligand of NPC1L1 and is an 
inhibitor of sterol and 5a-stanol absorption. 

The present invention also provides for a method for identifying a 

10 compound that inhibits intestinal sterol or 5a-stanol absorption mediated by NPC1 LI 
involving contacting NPC1L1 with a detectably labeled ligand and the candidate 
compound and determining whether the candidate compound binds to NPC1L1, 
wherein binding of said candidate compound to NPC1L1 modulates binding of said 
ligand to NPC1L1, wherein said modulation indicates that the candidate compound is 

15 an intestinal sterol or 5a-stanol absorption inhibitor. 

The present invention provides methods for identifying an ligand of 
NPC1L1 comprising (a) contacting a host cell (e.g., human embryonic kidney (HEK) 
293 cells, Chinese hamster ovary (CHO) cell, a J774 cell, a macrophage cell or a 
Caco2 cell) expressing a polypeptide comprising the amino acid sequence of SEQ ID 

20 NO: 2 or SEQ ID NO: 4 or SEQ ID NO: 12 or a functional fragment thereof on a cell 
surface, in the presence of a known amount of a detectably labeled (e.g., with H, C, 
,25 1, 35 S or fluorescence labeling) substituted azetidinone (e.g., ezetimibe), with a 
sample to be tested for the presence of an NPC1L1 ligand; and (b) measuring the 
amount of detectably labeled substituted azetidinone (e.g., ezetimibe) specifically 

25 bound to the polypeptide; wherein an NPC1L1 ligand in the sample is identified by 
measuring substantially reduced binding of the detectably labeled substituted 
azetidinone (e.g., ezetimibe) to the polypeptide, compared to what would be measured 
in the absence of such a ligand. 

Another method for identifying an ligand of NPC1L1 is also provided. 

30 The method comprises (a) placing, in an aqueous suspension, a plurality of support 
particles, impregnated with a fluorescer (e.g., yttrium silicate, yttrium oxide, 
diphenyloxazole and polyvinyltoluene), to which a host cell (e.g, Chinese hamster 
ovary (CHO) cell, a J774 cell, a macrophage cell or a Caco2 cell) expressing a 
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polypeptide comprising the amino acid sequence of SEQ ID NO: 2 or SEQ ID NO: 4 
or SEQ ID NO: 12 or a functional fragment thereof on a cell surface are attached; (b) 
adding, to the suspension, a radiolabeled (e.g., with 3 H, 14 C or !25 I) substituted 
azetidinone (e.g., ezetimibe) and a sample to be tested for the presence of a ligand, 
S wherein the radiolabel emits radiation energy capable of activating the fluorescer 
upon the binding of the substituted azetidinone (e.g., ezetimibe) to the polypeptide to 
produce ligjit energy, whereas radiolabeled substituted azetidinone (e.g., ezetimibe) 
that does not bind to the polypeptide is, generally, too far removed from the support 
particles to enable the radioactive energy to activate the fluorescer; and (c) measuring 
10 the light energy emitted by the fluorescer in the suspension; wherein an NPC1L1 

* 

ligand in the sample is identified by measuring substantially reduced light energy 
emission, compared to what would be measured in the absence of such a ligand. 

Also provided is a method for identifying a ligand of NPC1L1 
comprising (a) contacting a host cell (e.g., Chinese hamster ovary (CHO) cell, a J774 

15 cell, a macrophage cell or a Caco2 cell) expressing a polypeptide comprising an 
amino acid sequence of SEQ ID NO: 2 or SEQ ID NO: 4 or SEQ ID NO: 12 or a 
functional fragment thereof on a cell surface with detectably labeled (e.g., with *H, 

C or I) sterol (e.g., cholesterol) or 5<x-stanol and with a sample to be tested for the 
presence of an ligand; and (b) measuring the amount of detectably labeled sterol (e.g., 

20 cholesterol) or 5a-stanol in the cell; wherein an NPC1L1 antagonist in the sample is 
identified by measuring substantially reduced detectably labeled sterol (e.g., 
cholesterol) or 5ct-stanol within the host cell, compared to what would be measured in 
the absence of such an antagonist and wherein an NPC1L1 agonist in the sample is 
identified by measuring substantially increased detectably labeled sterol (e.g., 

25 cholesterol) or 5<x-stanol within the host cell, compared to what would be measured in 
the absence of such an agonist 

The present invention includes methods for inhibiting NPC1L1- 
mediated intestinal sterol (e.g., cholesterol) or 5a-stanol uptake, in a subject, by 
administering a substance identified by the screening methods described herein to the 

30 subject. Such substances include compounds such as small molecule antagonists of 
NPC1L1 other than ezetimibe. Also contemplated are methods for antagonizing 
NPC1 LI -mediated sterol (e.g., cholesterol) or 5a-stanol absorption by administering 
anti-NPClLl antibodies. NPC1 LI -mediated absorption of sterol (e.g., cholesterol) or 
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Sa-stanol can also be antagonized by any method which reduces expression of 
NPC1L1 in an organism. For example, NPC1L1 expression can be reduced by 
introduction of anti-sense NPC1L1 mRNA into a cell of an organism or by genetic 
mutation of the NPC1L1 gene in an organism {e.g., by complete knockout, disruption, 
5 truncation or by introduction of one or more point mutations). 

Also included in the present invention is a mutant transgenic mammal 
(e.g, mouse, rat, dog, rabbit, pig, guinea pig, cat, horse), preferably a mouse 
comprising a homozygous or heterozygous mutation (e.g, disruption, truncation, one 
or more point mutations, knock out) of endogenous, chromosomal NPC1L1 wherein, 

10 preferably, the mouse does not produce any functional NPC1L1 protein. Preferably, 
the mutant mouse, lacking functional NPC1L1, exhibits a reduced level of intestinal 
sterol (eg. 9 cholesterol) or 5a-stanol absorption and/or a reduced level of serum sterol 
(e.g, cholesterol) or 5a-stanol and/or a reduced level of liver sterol (e.g., cholesterol) 
or 5a-stanol as compared to that of a non-mutant mouse comprising functional 

1 5 NPC1L1 . Preferably, in the mutant mouse chromosome, the region ofNPCJLl (SEQ 
ID NO: 45) deleted is from nucleotide 790 to nucleotide 998. In one embodiment, 
NPC1L1 (SEQ ID NO: 1 1) is deleted from nucleotide 767 to nucleotide 975. Any 
offspring or progeny of a parent NPC1L1 mutant mouse (i.e., npclll) of the invention 
which has inherited an npclll mutant allele is also part of the present invention. 

20 The scope of the present invention also includes a method for 

screening a sample for an intestinal sterol (e.g., cholesterol) or 5<x-stanol absorption 
antagonist comprising (a) feeding a sterol (e.g., cholesterol) or 5a-stanol-containing 
substance (e.g., comprising radiolabeled cholesterol, such as A4 C-cIiolesterol or 3 H- 
cholesterol) to a first and second mouse comprising a functional NJPC1L1 gene and to 

25 a third, mutant mouse lacking a functional NPC1L1; (b) administering the sample to 
the first mouse comprising a functional NPC1L1 but not to the second mouse; (c) 
measuring the amount of sterol (e.g., cholesterol) or 5<x-stanol absorption in the 
intestine of said first, second and third mouse (e.g., by measuring serum cholesterol); 
and (d) comparing the levels of intestinal sterol (e.g., cholesterol) or 5a-stanol 

30 absorption in each mouse; wherein the sample is determined to contain the intestinal 
sterol (e.g., cholesterol) or 5a-stanol absorption antagonist when the level of intestinal 
sterol (e.g, cholesterol) or 5ct-stanol absorption in the first mouse and third mouse are 
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less than the amount of intestinal sterol (e.g., cholesterol) or Sa-stanol absorption, in 

the second mouse. 

The present invention also encompasses a kit comprising (a) a 
substituted azetidinone (e.g. 9 ezetimibe) in a pharmaceutical dosage form (e.g., a pill 

5 or tablet comprising 10 mg substituted azetidinone (e.g., ezetimibe)); and (b) 

information, for example in the form o f an insert, indicating that NPC1L1 is a target 
of ezetimibe. The kit may also include simvastatin in a pharmaceutical dosage form 
(e.g., a pill or tablet comprising 5 mg, 1 0 mg, 20 mg, 40 mg or 80 mg simvastatin). 
The simvastatin in pharmaceutical dosage form and the ezetimibe in pharmaceutical 

1 0 dosage form can be associated in a single pill or tablet or in separate pills or tablets. 

The present invention also provides any isolated mammalian cell (*?.g., 
isolated mouse cell, isolated rat cell or isolated human cell) which lacks a gene wliich 
encodes or can produce a functional NPC1L1 polypeptide. The isolated cell can toe 
isolated from a mutant mouse comprising a homozygous mutation of endogenous, 

1 5 chromosomal NPC1L1 wherein the mouse does not produce any functional NPC1L1 
protein. Further, the mutation can be in a gene which when un-mutated encodes an 
amino acid sequence of SEQ ID NO: 12 (e.g., comprising a nucleotide sequence of 
SEQ ID NO: 11). The cell can be isolated or derived from duodenum, gall bladder, 
liver, small intestine or stomach tissue. The cell can be an enterocyte. 

20 BRIEF PESCTTPHTTON OF THE FIGURES 

Figure 1 A shows an equilibrium saturation binding plot exhibiting the 

binding of 3 H-EZE-glucuronide to rhesxos brush border membrane (BBM) vesicles. 

Observed total binding (Total) is shown, as open circles; nonspecific binding (NS) as 

triangles, and specific binding (S-bind) as solid circles. 
25 Figure IB shows a scatcliard analysis of 3 H-EZE-glucuronide binding 

to rhesus brush border membrane vesicles. 

Figure 2A shows an equilibrium saturation binding plot exhibiting the 

binding of ^-EZE-glucuronide Q) to rat brush border membrane vesicles. Observed 

total binding (open circles) and nonspecific binding (triangles), determined in the 
30 presence of 100 yM unlabeled ezetimib e glucuronide, are included; specific binding 

(solid circles) was assessed from the difference between total and nonspecific binding. 
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Binding was measured at 2.5 mg protein/ml in. a volume of 100 pi after 1 hour 
incubation. Data were fit by nonlinear regression as described in Methods. 

Figure 2B shows scatchard analysis of 3 H-EZE-glucuronide binding to 
rat brush border membrane vesicles. The binding data identify a singje high-affinity 
5 site with K D = 542 nM and Bmax = 20.7 pmol/mg protein. 

Figure 3 A shows association kinetic analysis of H-EZE-glucuronide 
in rat brush border membrane vesicles. Conditions were 25 nM of 1 and 3 mg/ml 
protein at 25°C. The second-order rate constant kon (0.55 x 10" 4 NT 1 s" 1 ) was 
calculated from kot* (0.004 s" 1 ) as described in Methods. 
10 Figure 3B shows dissociation kinetic analysis of 3 H-EZE-glucuronide 

1 in rat brush border membrane vesicles. After the complex was formed by 
incubating 25 nM of 1 and 3 mg/ml protein for 1 hour, dissociation was initiated by 
competition with 100 jiM unlabeled ezetimibe glucuronide. The curve is theoretical 
for ko ff = 0.0024 s" 1 . 

1 5 Figure 4A shows association kinetic analysis of H-EZE-glucuronide 

in rhesus brush border membrane vesicles. 

Figure 4B shows dissociation kinetic analysis of 3 H-EZE-glucuronide 

in rhesus brush border membrane vesicles. 

Figure 5 shows the results of a binding assay where 3 H-EZE- 
20 glucuronide is dissociated by EZE-glucuronide and compound 2 from rhesus (A) and 
rat (B) brush border membrane vesicles. 

Figure 6 shows the results of a binding assay where 35 S-2 is dissociated 
by EZE-glucuronide and 2 from mouse brush border membrane vesicles. 

Figure 7 shows the distribution of 3 H-EZE-glucuronide binding to 
25 rhesus (A) and rat (B) brush border membranes prepared from various portions of 
rhesus (A) and rat (B) intestinal tissue. 

Figure 8 shows the results of a binding assay where 35 S-2 is dissociated 
by EZE-glucuronide and various analogs from CHO cells transfected with rat 
NPC1L1. 

30 Figure 9 shows the results of a binding assay where 35 S-2 is dissociated 

by EZE-glucuronide and various analogs from CHO cells transfected with human 
NPC1L1. 

-8- 
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Figure 1 0 shows the binding of S-2 to brush border membrane 
vesicles prepared from wild type (A) and NPC1L1 knockout (-/-) mice. 

Figure 1 1 shows the results of a binding assay where 35 S-2 is 
dissociated by compound 2 from mouse wild type and NPC1L1 knockout (-/-) brush 
5 border membrane vesicles. 

Figure 12A shows equilibrium determination of Kd for ezetimibe 
glucuronide by competition of unlabeled compound against 1 in rat enterocyte brush 
border membranes. Membranes (1 .5 mg/ml protein) were incubated with I (50 nM) 
and the indicated concentrations of ezetimibe glucuronide for 1 hour to ensure 
10 equilibrium. Kd at equilibrium is 600 nM. Figure 12B shows the corresponding 

measurement for rhesus monkey, which were conducted between 0.5 and 1 .25 mg/ml 
protein and 22-50 nM i, with incubation time of more than 3 hours. K© at 
• equilibrium is 38.6 nM. 



and mouse, along with polynucleotides encoding the respective polypeptides. 

25 Preferably, the rat NPC1L1 polypeptide comprises the amino acid sequence set forth 
in SEQ ID NO: 2, the human NPC1L1 comprises the amino acid sequence set forth in 
SEQ ID NO: 4 and the mouse NPC1L1 polypeptide comprises the amino acid 
sequence set forth in SEQ ID NO: 12. The rat NPC1L1 polynucleotide of SEQ ID 
NO: 1 or 10 encodes the rat NPC1 LI polypeptide. The human NPC1L1 

30 polynucleotide of SEQ ID NO: 3 encodes the human NPC1L1 polypeptide. The 

mouse NPC1L1 polynucleotide of SEQ ID NO: 11 or 13 encodes the mouse NPC1 LI 
polypeptide. 



20 



15 




DETAILED DESCRIPTION OF THE INVENTION 



The present invention includes NPC1L1 polypeptides from rat, human 
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The present invention includes any isolated polynucleotide or isolated 
polypeptide comprising a nucleotide or amino acid sequence referred to, below, in 
Table 1. 



Table 1. Polynucleotides and Polypeptides of the Invention. 



Polynucleotide or Polypeptide 


Sequence Identifier 


Rat NPC1L1 polynucleotide 


SEQ ID NO: 1 


RdXNPClLl polypeptide 


SEQ ID NO: 2 


Human NPC1 LI ool vnucleotide 


SEO ID NO: 3 


Human NPC1L1 polypeptide 


SEQ ID NO: 4 


Rat NPC1T*1 extwessed <;eouence tafr 

AVAL J li J J—i J. VnLfl WUuVU OVUUVUW O 

603 66208 0F1 foartial seauence^ 


SEO ID NO: 5 


Rat NPC1L1 exDressed seauence tan 
603665037F1 (partial sequence) 


SEQ ID NO: 6 


Rat NPC1L1 expressed seauence tas 
604034587F1 (partial sequence) 


SEQ ID NO: 7 


EST 603662080F1 with downstream 
sequences added 


SEQ ID NO: 8 


EST 603662080F1 with upstream and 
downstream sequences added 


SEQ ID NO: 9 


Back-translated polynucleotide sequence of 
xatNPClLl 


SEQ ID NO: 10 


Mouse NPC1 LI polynucleotide 


SEQ ID NO: 1 1 


Mouse NPC1 LI polypeptide 


SEQ ID NO: 12 


Back-translated polynucleotide sequence of 
mouse NPC1 LI 


SEQ ID NO: 13 


Back-translated polynucleotide sequence of 
human NPC1L1 


SEQ ID NO: 51 



5 

A human NPC1L1 is also disclosed under Genbank Accession Number 



AF 192522. As discussed below, the nucleotide sequence of the ratNPClLl set forth 
in SEQ ID NO: 1 was obtained from an expressed sequence tag (EST) from a rat 
jejunum enterocyte cDNA library. SEQ ID NOs: 5-7 include partial nucleotide 
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sequences of three independent cDNA clones. The downstream sequence of the SEQ 
ID NO: 5 EST (603662080F1) were determined; the sequencing data from these 
experiments are set forth in SEQ ID NO: 8. The upstream sequences were also 
determined; these data are set forth in SEQ ID NO: 9. 
5 SEQ ID NOs: 43 and 44 are the nucleotide and amino acid sequence, 

respectively, of human NPC1L1 which is disclosed under Genbank Accession No.: 
AF192522 (see Davies, et aL 9 (2000) Genomics 65(2): 137-45). 

SEQ ID NO: 45 is the nucleotide sequence of a mouse NPC1L1 which 
is disclosed under Genbank Accession No. AK078947. 

10 NPC1L1 mediates intestinal sterol (e.£., cholesterol) or 5a-stanol 

absorption. Inhibition of NPC1L1 in a patient is a useful method for reducing 
intestinal sterol (e.g. 9 cholesterol) or 5a-stanol absorption and serum sterol (e.g 9 
cholesterol) or 5a-stanol in the patient. Reducing the level of intestinal sterol (e.g., 
cholesterol) or 5a-stanol absorption and serum sterol (e.g. 9 cholesterol) or 5a-stanol 

15 in a patient is a useful way in which to treat or prevent the occurrence of 
atherosclerosis, particularly diet-induced atherosclerosis. 

As used herein, the term "sterol" includes, but is not limited to, 
cholesterol and phytosterols (including, but not limited to, sitosterol, campesterol, 
stigmasterol and avenosterol). 

20 As used herein, the term "5ct-stanol" includes, but is not limited to, 

cholestanol, 5a-campestanol and 5a-sitostanol. 

Without being limited by the present hypothesis, the examples present 
a better understanding of the putative molecular interaction between NPC1L1 and 
cholesterol. In this regard, one of the more interesting features of NPC1L1 is that it 

25 contains the sterol-sensing domain (SSD) originally observed in SCAP (SREBP 

cleavage-activating protein). SCAP controls activation of sterol regulatory element 
binding proteins (SREBP), a transcription factor which controls more than 35 genes 
related to lipid and cholesterol homeostasis (Brown, M.S. & Goldstein, J.L. A 
proteolytic pathway that controls the cholesterol content of membranes, cells, and 

30 blood. Proc. Natl Acad. ScL U.S.A. 96, 1 1041-1 1048 (1999)). The SSD, consisting 
of ~1 80 amino acids in a packet of 5 putative membrane-spanning helices, also serves 
a regulatory function in two key enzymes on the cholesterol biosynthesis pathway and 
is present in the receptor Patched. Recently, high affinity binding of cholesterol to the 
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SSD on SCAP has been demonstrated (Radhakrishnan, A., Sun, L., Kwon, H.J., 
Brown, M.S. & Goldstein, J.L., "Direct binding of cholesterol to the purified 
membrane region of SCAP: Mechanism for a sterol-sensing domain," MoL Cell 15, 
259-268 (2004)), suggesting that cholesterol may similarly bind to the SSD of 
5 NPC1 LI , and raising the possibility that ezetimibe may compete with cholesterol for 
binding at this site. 

Molecular Biology 

In accordance with the present invention there may be employed 
conventional molecular biology, microbiology, and recombinant DNA techniques 

10 within the skill of the art. Such techniques are explained fully in the literature. See, 
e.g. , Sambrook, Fritsch & Maniatis, Molecular Cloning: X Laboratory Manual, 
Second Edition (1989) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
New Yoik (herein "Sambrook, et al y 1989"); DNA Cloning: A Practical Approach, 
Volumes I and II (D.N. Glover ed. 1985); Oligonucleo tide Synthesis (MJ. Gait, ed. 

15 1984); Nucleic Acid Hybridization (B.D. Hames & S.J. Higgins, eds. (1985)); 

Transcriptio n And Translation (B.D. Hames & S.J. Higgins, eds. (1984)); Animal Cell 
Culture (R.I. Freshney, ed. (1986)); Immobilized Cells And Enzymes (IRL Press, 
(1986)); B. Perbal, A Practical Guide To Molecular Cloning (1984); F.M. Ausubel, 
et al (eds.), Current Protocols in Molecular Biology, John Wiley & Sons, Inc. (1994). 

20 The back-translated sequences of SEQ ED NO: 1 0 and of SEQ ID NO: 

13 uses the single-letter code shown in Table 1 of Annex C, Appendix 2 of the PCT 
Administrative Instruction in the Manual of Patent Examination Procedure. 

A "polynucleotide", "nucleic acid " or "nucleic acid molecule" may 
refer to the phosphate ester polymeric form of ribonucleosides (adenosine, guanosine, 

25 uridine or cytidine; "RNA molecules") or deoxyribonucleosides (deoxyadenosine, 
deoxyguanosine, deoxythymidine, or deoxycytidine; "DNA molecules"), or any 
phosphoester analogs thereof; such as phosphorothioateS and thioesters, in single 
stranded form, double-stranded form or otherwise. 

A "polynucleotide sequence", "nucleic acid sequence" or "nucleotide 

30 sequence" is a series of nucleotide bases (also called "nucleotides") in a nucleic acid, 
such as DNA or RNA, and means any chain of two or more nucleotides. 
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A "coding sequence" or a sequence "encoding" an expression product, 
such as a RNA, polypeptide, protein, or enzyme, is a nucleotide sequence that, when 
expressed, results in production of the product. 

The term "gene" means a DNA sequence that codes for or corresponds 
5 to a particular sequence of ribonucleotides or amino acids which comprise all or part 
of one or more RNA molecules, proteins or enzymes, and may or may not include 
regulatory DNA sequences, such as promoter sequences, which determine, for 
example, the conditions under which the gene is expressed. Genes may be transcribed 
from DNA to RNA which may or may not be translated into an amino acid sequence. 

1 0 The present invention includes nucleic acid fragments of any of SEQ 

ID NOs: 1,5-11 or 13. A nucleic acid "fragment" includes at least about 30 (e.g., 31, 
32, 33, 34), preferably at least about 35 (e.g, 25, 26, 27, 28, 29, 30, 31, 32, 33 or 34), 
more preferably at least about 45 (e.g., 35, 36, 37, 38, 39, 40, 41, 42, 43 or 44), and 
most preferably at least about 126 or more contiguous nucleotides (e.g., 130, 131, 

15 132, 133, 134, 135, 136, 137, 138, 139, 140, 150, 160, 170, 180, 190, 200, 300, 400, 
500, 1000 or 1200) from any of SEQ ID NOs: 1, 5-11 or 13. 

The present invention also includes nucleic acid fragments consisting 
of at least about 7 (e.g., 9, 12, 17, 19), preferably at least about 20 (e.g. y 30, 40, 50, 
60), more preferably about 70 (e.g., 80, 90, 95), yet more preferably at least about 100 

20 (e.g., 105, 1 10, 1 14) and even more preferably at least about 1 1 5 (e.g, 1 17, 1 19, 120, 
122, 124, 125, 126) contiguous nucleotides from any of SEQ ID NOs: 1, 5-11 or 13. 

, As used herein, the term "oligonucleotide" refers to a nucleic acid, 
generally of no more than about 100 nucleotides (e.g., 30, 40, 50, 60, 70, 80, or 90), 
that may be hybridizable to a genomic DNA molecule, a cDNA molecule, or an 

25 mRNA molecule encoding a-gene, mRNA, cDNA, or other nucleic acid of interest 
Oligonucleotides can be labeled, e.g., by incorporation of 32 P-nucleotides, 3 H- 
nucleotides, ,4 C-nucleotides, 35 S-nucleotides or nucleotides to which a label, such as 
biotin, has been covalently conjugated. In one embodiment, a labeled oligonucleotide 
can be used as a probe to detect the presence of a nucleic acid. In another 

30 embodiment, oligonucleotides (one or both of which may be labeled) can be used as 
PCR primers, either for cloning full length or a fragment of the gene, or to detect the 
presence of nucleic acids. Generally, oligonucleotides are prepared synthetically, 
preferably on a nucleic acid synthesizer. 
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A "protein sequence", '^peptide sequence" or "polypeptide sequence" 
or "amino acid sequence" may refer to a series of two or more amino acids in a 
protein, peptide or polypeptide. 

"Protein", "peptide" or "polypeptide" includes a contiguous string of 
5 two or more amino acids. Preferred peptides of the invention include those set forth 
in any of SEQ ID NOs: 2 or 12 as well as variants and fragments thereof. Such 
fragments preferably comprise at least about 10 (e.g., 1 1, 12, 13, 14, 15, 16, 17, 18 or 
19), more preferably at least about 20 (eg., 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 
40), and yet more preferably at least about 42 (e.g., 43, 44, 45, 46, 47, 48, 49, 50, 60, 
10 70, 80, 90, 100, 1 10, 120 or 130) or more contiguous amino acid residues from any of 
SEQ ID NOs: 2 or 12. 

The present invention also includes polypeptides, preferably antigenic 
polypeptides, consisting of at least about 7 (e.g. 7 9, 10, 13, 1 5, 17, 19), preferably at 
least about 20 (e.g., 22, 24, 26, 28), yet more preferably at least about 30 (e.g., 32, 34, 
15 36, 38) and even more preferably at least about 40 (e.g., 41, 42) contiguous amino 
acids from any of SEQ ID NOs: 2 or 12. 

The polypeptides of the invention can be produced by proteolytic 
cleavage of an intact peptide, by chemical synthesis or by the application of 
recombinant DNA technology and are not limited to polypeptides delineated by 
20 proteolytic cleavage sites. The polypeptides, either alone or cross-linked or 

conjugated to a carrier molecule to raider them more immunogenic, are useful as 
antigens to elicit the production of antibodies and fragments thereof. The antibodies 
can be used, e.g., in immunoassays for iromunoaffinity purification or for inhibition of 
NPC1L1, etc. 

25 The terms "isolated polynucleotide" or "isolated polypeptide" include 

a polynucleotide (e.g., RNA or DNA molecule, or a mixed polymer) or a polypeptide, 
respectively, which are partially or fully separated from other components that are 
normally found in cells or in recombinant DNA expression systems. These 
components include, but are not limited to, cell membranes, cell walls, ribosomes, 

30 polymerases, serum components and extraneous genomic sequences. 

An isolated polynucleotide or polypeptide will, preferably, be an 
essentially homogeneous composition of molecules but may contain some 
heterogeneity. 

♦ 
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"Amplification** of DNA as used herein may denote the use of 
polymerase chain reaction (PCR) to increase the concentration of a particular DNA 
sequence within a mixture of DNA sequences. For a description of PCR see Saiki, et 
aL, Science (1988) 239: 487. 
5 The term "host cell" includes any cell of any organism that is selected, 

modified, transfected, transformed, grown, or used or manipulated in any way, for the 
production of a substance by the cell, for example, the expression or replication, by 
the cell, of a gene, a DNA or RNA sequence or a protein. Preferred host cells include 
HEK-293 cells, Chinese hamster ovary (CHO) cells, murine macrophage J774 cells or 

10 any other macrophage cell line and human intestinal epithelial Caco2 cells. 

The nucleotide sequence of a nucleic acid may be determined by any 
method known in the art (e.g., chemical sequencing or enzymatic sequencing). 
"Chemical sequencing" of DNA includes methods such as that of Maxam and Gilbert 
(1977) (Proc. Natl. Acad. Sci. USA 74: 560), in which DNA is randomly cleaved 

1 5 using individual base-specific reactions. "Enzymatic sequencing" of DNA includes 
methods such as that of Sanger (Sanger, et al^ (1977) Proc. Natl. Acad. Sci. USA 74: 
5463). 

The nucleic acids herein may be flanked by natural* regulatory 
(expression control) sequences, or may be associated with heterologous sequences, 

20 including promoters, internal ribosome entry sites (IRES) and other ribosome binding 
site sequences, enhancers, response elements, suppressors, signal sequences, 
polyadenylation sequences, introns, 5'- and 3'- non-coding regions, and the like. 

In general, a "promoter" or "promoter sequence" is a DNA regulatory 
region capable of binding an RNA polymerase in a cell (e.g., directly or through other 

25 promoter-bound proteins or substances) and initiating transcription of a coding 
sequence. A promoter sequence is, in general, bounded at its 3' terminus by the 
transcription initiation site and extends upstream (5' direction) to include the 
minimum number of bases or elements necessary to initiate transcription at any level. 
Within the promoter sequence may be found a transcription initiation site 

30 (conveniently defined, for example, by mapping with nuclease SI), as well as protein 
binding domains (consensus sequences) responsible for the binding of RNA 
polymerase. The promoter may be operably associated with other expression control 
sequences, including enhancer and repressor sequences or with a nucleic acid of the 
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invention. Promoters which may be used to control gene expression include, but are 
not limited to, cytomegalovirus (CMV) promoter (U.S. Patent Nos. 5,385,839 and 
5,168,062), the SV40 early promoter region (Benoist, et aL, (1981) Nature 290: 304- 
3 1 0), the promoter contained in the 3 * long terminal repeat of Rous sarcoma virus 
5 (Y amamoto, et aL, (1 980) Cell 22: 787-797), the herpes thymidine kinase promoter 
(Wagner, et aL, (1981) Proc. NatL Acad. Sci. USA 78: 1441-1445), the regulatory 
sequences of the metallothionein gene (Brinster, et aL, (1982) Nature 296: 39-42); 
prokaryotic expression vectors such as the P-lactamase promoter (Villa-Komaroff, 
et aL, (1978) Proc. Natl. Acad. Sci. USA 75: 3727-3731), or the tac promoter 

10 (DeBoer, et aL, (1983) Proc. Natl. Acad Sci. USA 80: 21-25); see also "Useful 
proteins from recombinant bacteria" in Scientific American (1980) 242: 74-94; and 
promoter elements from yeast or other fungi such as the Gal 4 promoter, the ADC 
(alcohol dehydrogenase) promoter, PGK (phosphoglycerol kinase) promoter or the 
alkaline phosphatase promoter. 

15 A coding sequence is * 'under the control of", "functionally associated 

with" or "operably associated with" transcriptional and translation^ control 
^ sequences in a cell when the sequences direct RNA polymerase mediated 

transcription of the coding sequence into RNA, preferably mRNA, which then may be 
RNA spliced (if it contains introns) and, optionally, translated into a protein encoded 

20 by the coding sequence. 

The terms "express" and "expression" mean allowing or causing the 
information in a gene, RNA or DNA sequence to become manifest; for example, 
producing a protein by activating the cellular functions involved in transcription and 
translation of a corresponding gene. A DNA sequence is expressed in or by a cell to 

25 form an "expression producf * such as an RNA (e.g., mRNA) or a protein. The 
expression product itself may also be said to be "expressed" by the cell. 

The term "transformation" means the introduction of a nucleic acid 
into a cell. The introduced gene or sequence may be called a "clone". A host cell that 
receives the introduced DNA or RNA has been "transformed" and is a "transformant" 

30 or a "clone." The DNA or RNA introduced to a host cell can come from any source, 
including cells of the same genus or species as the host cell, or from cells of a 
different genus or species. 



16- 



WO 2005/069900 



PCT/US2005/001469 



The term "vector" includes a vehicle (e.g, a plasmid) by which a DNA 
or KNA sequence can be introduced into a host cell, so as to transform the host and, 
optionally, promote expression and/or replication of the introduced sequence. 

Vectors that can be used in this invention include plasmids, viruses, 
5 bacteriophage, integratable DNA fragments, and other vehicles that may facilitate 
introduction of the nucleic adds into the genome of the host Plasmids are the most 
commonly used form of vector but all other forms of vectors which serve a similar 
function and which are, or become, known in the art are suitable for use herein. See, 
e.g, Pouwels, et al., Cloning Vectors: A Lab oratory Manual 1985 and Supplements, 
1 0 Elsevier, N.Y., and Rodriguez et al. (eds.), Vectors: A Survey of Molecular C loning 
Vectors and Their Uses, 1988, Buttersworth, Boston, MA. 

The term "expression system" means a host cell and compatible vector 
which, under suitable conditions, can express a protein or nucleic acid which is 
carried by the vector and introduced to the host cell. Common expression systems 
1 5 include E. coli host cells and plasmid vectors, insect host cells and Baculovirus 
vectors, and mammalian host cells and vectors* 

Expression of nucleic acids encoding theNPClLl polypeptides of this 
invention can be carried out by conventional methods in either prokaryotic or 
eukaryotic cells. Although E. coli host cells are employed most frequently in 
20 prokaryotic systems, many other bacteria, such as various strains of Pseudomonas and 
Bacillus, are known in the art and can be used as well. Suitable host cells for 
expressing nucleic acids encoding the NPC1L1 polypeptides include prokaryotes and 
higher eukaryotes. Prokaryotes include both gram-negative and gram-positive 
organisms, e.g., E. coli and B. subtilis. Higher eukaryotes include established tissue 
25 culture cell lines from animal cells, both of non-mammalian origin, e.g., insect cells, 
and birds, and of mammalian origin, e.g., human, primates, and rodents. 

Prokaryotic host-vector systems include a wide variety of vectors for 
many different species. A representative vector for amplifying DNA is pBR322 or 
many of its derivatives {e.g., pUC18 or 19). Vectors that can be used to express the 
30 NPC1L1 polypeptides include, but are not limited to, those containing the lac 

promoter (pUC-series); trp promoter (pBR322-fcp); Ipp promoter (the pIN-series); 
lambda-pP or pR promoters (pOTS); or hybrid promoters such asptac (pDR540). See 
Brosius et al, "Expression Vectors Employing Lambda-, trp-, lac-, and ipp-derived 
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Promoters" in Rodriguez and Denhardt (eds.) Vectors: A Survey of Molecular 
Cloning Vectors and Their Uses, 1988, Buttersworth, Boston, pp. 205-236. Many 
polypeptides can be expressed, at high levels, in an E.colifn expression system as 
disclosed in U.S. Patent Nos. 4,952,496; 5,693,489 and 5,869,320 and in Davanloo, 
5 P., et aL, (1984) Proc. Natl. Acad. ScL USA 81 : 2035-2039; Studier, F.W., et al 9 

(1986) J. Mol. Biol. 189: 113-130; Rosenberg, A. H., etaL, (1987) Gene 56: 125-135; 
and Dunn, J.J., et aL, (1988) Gene 68: 259. 

Higher eukaryotic tissue culture cells may also be used for the 
recombinant production of the NPC1L1 polypeptides of the invention. Although any 

1 0 higher eukaryotic tissue culture cell line might be used, including insect baculovirus 
expression systems, mammalian cells are preferred. Transformation or transfection 
and propagation of such cells have become a routine procedure. Examples of useful 
cell lines include HeLa cells, Chinese hamster ovary (CHO) cell lines, J774 cells, 
HEK-293 cells, Caco2 cells, baby rat kidney (BRK) cell lines, insect cell lines, bird 

1 5 cell lines, and monkey (COS) cell lines. Expression vectors for such cell lines usually 
include an origin of replication, a promoter, a translation initiation site, RNA splice 
sites (if genomic DNA is used), a polyadenylation site, and a transcription termination 
site. These vectors also, usually, contain a selection gene or amplification gene. 
Suitable expression vectors may be plasmids, viruses, or retroviruses carrying 

20 promoters derived, e.g., from such sources as adenovirus, SV40, parvoviruses, 
vaccinia virus, or cytomegalovirus. Examples of expression vectors include 
pCR®3.1, pCDNAl, pCD (Okayama, et al, y (1985) Mol. Cell Biol. 5: 1 136), 
pMClneo Poly-A (Thomas, et aL, (1987) Cell 51: 503), pREP8, pSVSPORT and 
derivatives thereof, and baculovirus vectors such as pAC373 or pAC610. One 

25 embodiment of the invention includes membrane bound NPC1L1. In this 

embodiment, NPC1L1 can be expressed in the cell membrane of a eukaryotic cell and 
the membrane bound protein can be isolated from the cell by conventional methods 
which are known in the art. 

The present invention also includes fusions which include the NPC1L1 

30 polypeptides and NPC1L1 polynucleotides of the present invention and a second 
polypeptide or polynucleotide moiety, which may be referred to as a "tag". The 
fusions of the present invention may comprise any of the polynucleotides or 
polypeptides set forth in Table 1 or any subsequence or fragment thereof (discussed 
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above). The fused polypeptides of the invention may be conveniently constructed, for 
example, by insertion of a polynucleotide of the invention or fragment thereof into an 
expression vector. The fusions of the invention may include tags which facilitate 
purification or detection. Such tags include glutathione-S-transferase (GST), 
5 hexahistidine (His6) tags, maltose binding protein (MB?) tags, haemagglutinin (HA) 
tags, cellulose binding protein (CBP) tags and myc tags. Detectable tags such as 32 P, 
35 S, 3 H, ""Tc, ,23 I, u, In, 68 Ga, 18 F, l2 \ 13, I, U3m In, 76 Br, 67 Ga, ""Tc, 123 I, in In and 
68 Ga may also be used to label the polypeptides and polynucleotides of the invention. 
Methods for constructing and using such fusions are very conventional and well 

10 known in the art. 

Modifications (c#., post-translational modifications) that occur in a 
polypeptide often will be a function of how it is made. For polypeptides made by 
expressing a cloned gene in a host, for instance, the nature and extent of the 
modifications, in large part, will be determined by the host cell's post-translational 

15 modification capacity and the modification signals present in the polypeptide amino 
acid sequence. For instance, as is well known, glycosylation often does not occur in 
bacterial hosts such as E. colL Accordingly, when glycosylation is desired, a 
polypeptide can be expressed in a glycosylating host, generally a eukaryotic cell. 
Insect cells often carry out post-translational glycosylations which are similar to those 

20 of mammalian cells. For this reason, insect cell expression systems have been 
developed to express, efficiently, mammalian proteins having native patterns of 
glycosylation. An insect cell which may be used in this invention is any cell derived 
from an organism of the class Insect a. Preferably, the insect is Spodoptera 
fruigiperda (Sf9 or S£21) or Trichoplusia ni (High 5). Examples of insect expression 

25 systems that can be used with the present invention, for example to produce NPC1L1 
polypeptide, include Bac-To-Bac (Invitrogen Corporation, Carlsbad, CA) or Gateway 
(Invitrogen Corporation, Carlsbad, CA). If desired, deglycosylation enzymes can be 
used to remove carbohydrates attached during production in eukaryotic expression 
systems. 

30 Other modifications may also include addition of aliphatic esters or 

amides to the polypeptide carboxyl terminus. The present invention also includes 
analogs of the NPC1L1 polypeptides which contain modifications, such as 
incorporation of unnatural amino acid residues, or phosphorylated amino acid 
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residues such as phosphotyrosine, phosphoserine or phosphothreonine residues. 
Other potential modifications include sulfonation, biotinylation, or the addition of 
other moieties. For example, the NPC1L1 polypeptides of the invention may be 
appended with a polymer which increases the half-life of the peptide in the body of a 
5 subject. Preferred polymers include polyethylene glycol (PEG) (e.g. , PEG with a 
molecular weight of 2 kDa, 5 kDa, 10 kDa, 12 kDa, 20 kDa, 30 kDa and 40 kDa), 
dextran and monomethoxypolyethylene glycol (mPEG). 

The peptides of the invention may also be cyclized. Specifically, the 
amino- and carboxy-terminal residues of an NPC1L1 polypeptide or two internal 
10 residues of an NPC1L1 polypeptide of the invention can be fused to create a cyclized 
peptide. Methods for cyclizing peptides are conventional and very well known in the 
art; for example, see Gurrath, et aL y (1992) Eur. J. Biochem. 210: 911-921. 

The present invention contemplates any superficial or slight 
modification to the amino acid or nucleotide sequences which correspond to the 
15 polypeptides of the invention. In particular, the present invention contemplates 

sequence conservative variants of the nucleic acids which encode the polypeptides of 
the invention. "Sequence-conservative variants" of a polynucleotide sequence are 
those in which a change of one or more nucleotides in a given codon results in no 
alteration in the amino acid encoded at that position. Function-conservative variants 
20 of the polypeptides of the invention are also contemplated by the present invention. 
"Function-conservative variants" are those in which one or more amino acid residues 
in a protein or enzyme have been changed without altering the overall conformation 
and function of the polypeptide, including, but, by no means, limited to, replacement 
of an amino acid with one having similar properties. Amino acids with similar 
25 properties are well known in the art. For example, polar/hydrophilic amino acids 
which may be interchangeable include asparagine, glutamine, serine, cysteine, 
threonine, lysine, arginine, histidine, aspartic acid and glutamic acid; 
nonpolar/hydrophobic amino acids which may be interchangeable include glycine, 
alanine, valine, leucine, isoleucine, proline, tyrosine, phenylalanine, tryptophan and 
30 methionine; acidic amino acids, which may be interchangeable include aspartic acid 
and glutamic acid and basic amino acids, which may be interchangeable include 
histidine, lysine and arginine. 
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The present invention includes polynucleotides encoding rat, human or 
mouse NPC1L1 and fragments thereof as well as nucleic acids which hybridize to the 
polynucleotides. Preferably, the nucleic acids hybridize under low stringency 
conditions, more preferably under moderate stringency conditions and most 
preferably under high stringency conditions. A nucleic acid molecule is 
"hybridizable" to another nucleic acid molecule, such as a cDNA, genomic DNA, or 
RNA, when a single stranded form of the nucleic acid molecule can anneal to the 
other nucleic acid molecule under the appropriate conditions of temperature and 
solution ionic strength (see Sambrook, et aL, supra). The conditions of temperature 
and ionic strength determine the "stringency" of the hybridization. Typical low 
stringency hybridization conditions are 55°C, 5X SSC, 0.1% SDS, 0.25% millr, and 
no formamide at 42°C; or 30% formamide, 5X SSC, 0.5% SDS at 42°C. Typical, 
moderate stringency hybridization conditions are similar to the low stringency 
conditions except the hybridization is carried out in 40% formamide, with 5X or 6X 
SSC at 42°C. High stringency hybridization conditions are similar to low stringency 
conditions except the hybridization conditions are carried out in 50% formamide, 5X 
or 6X SSC and, optionally, at a higher temperature (e.g., higher than 42°C: 57°C, 
59°C, 60°C, 62°C, 63°C, 65°C or 68°C). In general, SSC is 0.15M NaCl and 0.01 5M 
Na-citrate. Hybridization requires that the two nucleic acids contain complementary 
sequences, although, depending on the stringency of the hybridization, mismatches 
between bases are possible. The appropriate stringency for hybridizing nucleic acids 
depends on the length of the nucleic acids and the degree of complementation, 
variables well known in the art The greater the degree of similarity or homology 
between two nucleotide sequences, the higher the stringency under which the nucleic 
acids may hybridize. For hybrids of greater than 1 00 nucleotides in length, equations 
for calculating the melting temperature have been derived (see Sambrook, et aL, 
supra, 9.50-9.51). For hybridization with shorter nucleic acids, i.e., oligonucleotides, 
the position of mismatches becomes more important, and the length of the 
oligonucleotide determines its specificity (see Sambrook, et al 9 supra). 

Also included in the present invention are polynucleotides comprising 
nucleotide sequences and polypeptides comprising amino acid sequences which are at 
least about 70% identical, preferably at least about 80% identical, more preferably at 
least about 90% identical and most preferably at least about 95% identical {e.g., 95%, 
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96%, 97%, 98%, 99%, 100%) to the reference mtNPClLl nucleotide (e.g. 9 any of 
SEQ ID NOs: 1 or 5-10) and amino acid sequences (e.g., SEQ ID NO: 2), reference 
human NPC1LJ nucleotide (e.g., SEQ ED NO: 3) and amino acid sequences (e.g. 9 
SEQ ID NO: 4) or the reference mouse NPC1L1 nucleotide (e.g. 9 any of SEQ ID 
5 NOs: 11 or 13) and amino acid sequences (e.g. 9 SEQ ID NO: 12), when the 
comparison is performed by a BLAST algorithm wherein the parameters of the 
algorithm are selected to give the, largest match between the respective sequences over 
the entire length of the respective reference sequences. Polypeptides comprising 
amino acid sequences which are at least about 70% similar, preferably at least about 

1 0 80% similar, more preferably at least about 90% similar and most preferably at least 
about 95% similar (e.g. 9 95%, 96%, 97%, 98%, 99%, 100%) to the reference rat 
NPC1 LI amino acid sequence of SEQ ID NO: 2, reference human NPC1L1 amino 
acid sequence of SEQ ID NO: 4 or the reference mouse NPC1L1 amino acid sequence 
of SEQ ID NO: 12, when the comparison is performed with a BLAST algorithm 

1 5 wherein the parameters of the algorithm are selected to give the largest match 

between the respective sequences over the entire length of the respective reference 
sequences, are also included in the present invention. 

Sequence identity refers to exact matches between the nucleotides or 
amino acids of two sequences which are being compared. Sequence similarity refers 

20 to both exact matches between the amino acids of two polypeptides which are being 
compared in addition to matches between nonidentical, biochemically related amino 
acids. Biochemically related amino acids which share similar properties and may be 
interchangeable are discussed above. 

The following references regarding the BLAST algorithm are herein 

25 incorporated by reference: BLAST ALGORITHMS: Altschul, S.F., et al. 9 (1990) J. 
Mol. Biol. 215: 403-410; Gish, W., et al 9 (1993) Nature Genet 3: 266-272; Madden, 
T.L., etal. y (1996) Meth. Enzymol. 266: 131-141; Altschul, S.F., etal. 9 (1997) 
Nucleic Acids Res. 25: 3389-3402; Zhang, J., et al. 9 (1997) Genome Res. 7: 649-656; 
Wootton, J.C., et aL 9 (1993) Comput Chem. 17: 149-163; Hancock, J.M., et al. y 

30 (1994) Comput. Appl. Biosci. 10: 67-70; ALIGNMENT SCORING SYSTEMS: 
Dayhoff, M.O., et aL 9 "A model of evolutionary change in proteins" in Adas of 
Protein Sequence and Structure. (1978) vol. 5, suppl. 3. M.O. Dayhoff (ed.), pp. 345- 
352, Natl. Biomed. Res. Found., Washington, DC; Schwartz, R.M., et al. 9 "Matrices 
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for detecting distant relationships" in Atlas of Protein Sequence and Structure. (1978) 
vol. 5, suppi: 3. M.O. Dayhoff (ed.), pp. 353-358, Natl. Biomed Res. Found., 
Washington, DC; Altschul, S.F., (1991) J. Mol. Biol. 219: 555-565; States, D.J., etal., 
(1991) Methods 3: 66-70; Henikoff, S., etal. 9 (1992) Proa Natl. Acad. Sci. USA 89: 
5 10915-10919; Altschul, SJF., etal, (1993) J. Mol. Evol. 36: 290-300; ALIGNMENT 
STATISTICS: Karlin, S., et al, (1990) Proc. Natl. Acad. Sci. USA 87: 2264-2268; 
Karlin, S., et al., (1993) Proc. Natl. Acad. Sci. USA 90: 5873-5877; Dembo, A., et 
aL 3 (1994) Ann. Prob. 22: 2022-2039; and Altschul, S.F. "Evaluating the statistical 
significance of multiple distinct local alignments'* in Theoretical and Computational 
10 Methods in Genome Research (S. Suhai, ed.), (1997) pp. 1-14, Plenum, New York. 

Protein Purification 

The proteins, polypeptides and antigenic fragments of this invention 
can be purified by standard methods, including, but not limited to, salt or alcohol 
precipitation, affinity chromatography (e.g, used in conjunction with a purification 

15 tagged NPC1L1 polypeptide as discussed above), preparative disc-gel electrophoresis, 
isoelectric focusing, high pressure liquid chromatography (HPLC), reversed-phase 
HPLC, gel filtration, cation and anion exchange and partition chromatography, and 
countercuirent distribution. Such purification methods are well known in the art and 
are disclosed, e.g., in "Guide to Protein Purification", Methods in Enzvmology. Vol. 

20 1 82, M. Deutscher, Ed., 1 990, Academic Press, New York, NY. 

Purification steps can be followed by performance of assays for 
receptor binding activity as described below. Particularly where an NPC1L1 
polypeptide is being isolated from a cellular or tissue source, it is preferable to include 
one or more inhibitors of proteolytic enzymes in the assay system, such as 

25 phenylmethanesulfonyl fluoride (PMSF), Pefebloc SC, pepstatin, leupeptin, 
chymostatin and EDTA. 

Antibody Molecules 

Antigenic (including immunogenic) fragments of the NPC1L1 
polypeptides of the invention are within the scope of the present invention (e.g., 42 or 
30 more contiguous amino acids from SEQ ID NO: 2, 4 or 12). The antigenic peptides 
may be useful, inter alia, for preparing isolated antibody molecules which recognize 
NPC1L1. Isolated anti-NPClLl antibody molecules are useful NPC1L1 ligands. 
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An antigen is any molecule that can bind specifically to an antibody. 
Some antigens cannot, by themselves, elicit antibody production. Those that can 
induce antibody production are immunogens. 

Preferably, isolated anti-NPClLl antibodies recognize an antigenic 
peptide comprising an amino acid sequence selected from SEQ ID NOs: 39-42 (e.g. 9 
an antigen derived from rat NPC1L1). More preferably, the antibody is A0715, 
A0716, A0717, A0718, A0867, A0868, A1801 or A1802. 

The term "antibody molecule " includes, but is not limited to, 
antibodies and fragments (preferably antigen-binding fragments) thereof. The term 
includes monoclonal antibodies, polyclonal antibodies, bispecific antibodies, Fab 
antibody fragments, F(ab)2 antibody fragments, Fv antibody fragments (e.g. 9 Vh or 
ViX single chain Fv antibody fragments and dsFv antibody fragments. Furthermore, 
the antibody molecules of the invention may be fully human antibodies, mouse 
antibodies, rat antibodies, rabbit antibodies, goat antibodies, chicken antibodies, 
humanized antibodies or chimeric antibodies. 

Although it is not always necessary, when NPC1L1 polypeptides are 
used as antigens to elicit antibody production in an immunologically competent host, 
smaller antigenic fragments are, preferably, first rendered more immunogenic by 
cross-linking or concatenation, or by coupling to an immunogenic carrier molecule 
(i.e., a macromolecule having the property of independently eliciting an 
immunological response in a host animal, such as diptheria toxin or tetanus). Cross- 
linking or conjugation to a carrier molecule may be required because small 
polypeptide fragments sometimes act as haptens, (molecules which are capable of 
specifically binding to an antibody but incapable of eliciting antibody production, i.e., 
they are not immunogenic). Conjugation of such fragments to an immunogenic 
carrier molecule renders them more immunogenic through what is commonly known 
as the "carrier effect". 

Carrier molecules include, e.g. 9 proteins and natural or synthetic 
polymeric compounds such as polypeptides, polysaccharides, lipopolysaccharides, 
etc. Protein carrier molecules are especially preferred, including, but not limited to, 
keyhole limpet hemocyanin and mammalian serum proteins such as human or bovine 
gammaglobulin, human, bovine or rabbit serum albumin, or methylated or other 
derivatives of such proteins. Other protein carriers will be apparent to those skilled in 
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the art Preferably, the protein carrier will be foreign to the host animal in which 
antibodies against the fragments are to be elicited. 

Covalent coupling to the carrier molecule can be achieved using 
methods well known in the ait, the exact choice of which will be dictated by the 
5 nature of the carrier molecule used. When the immunogenic carrier molecule is a 
protein, the fragments of the invention can be coupled, e.g. 9 using water-soluble 
carbodiimides such as dicyclohexylcaAodiimide or glutaraldehyde. 

Coupling agents, such as these, can also be used to cross-link the 
fragments to themselves without the use of a separate carrier molecule. Such cross- 
10 linking into aggregates can also increase immunogenicity. immunogenicity can also 
be increased by the use of known adjuvants, alone or in combination with coupling or 
aggregation. 

Adjuvants for the vaccination of animals include, but are not limited 
to, Adjuvant 65 (containing peanut oil, mannide monooleate and aluminum 

15 monostearate); Freund's complete or incomplete adjuvant; mineral gels such as 
aluminum hydroxide, aluminum phosphate and alum; surfactants such as 
hexadecylamine, octadecylamine, lysolecithin, dimethyldioctadecylammonium 
bromide, N,N-dioctadecyl-N* r N , -bis(2-hydroxymethyl) propanediamine, 
methoxyhexadecylglycerol and pluronic polyols; polyanions such as pyran, dextran 

20 sulfate, poly IC, polyacrylic acid and carbopol; peptides such as muramyl dipeptide, 
dimethylglycine and tuftsin; and oil emulsions. The polypeptides could also be 
administered following incorporation into liposomes or other microcarriers. 

Information concerning adjuvants and various aspects of 
immunoassays are disclosed, e.g. y in the series by P. Tijssen, Practice an d Theory of 

25 Enzyme Im munoassays, 3rd Edition, 1987, Elsevier, New York. Other useful 

references covering methods for preparing polyclonal antisera include Microbiology. 
1969, Hoeber Medical Division, Harper and Row; Landsteiner, Specificity of 
Serological Reactions. 1962, Dover Publications, New York, and Williams, et aL 9 
Methods in Immunology and Tmrminnchemistrv, Vol. 1, 1967, Academic Press, 

30 New York. 

The anti-NPClLl antibody molecules of the invention preferably 
recognize human, mouse or rat NPC1L1; however, the present invention includes 
antibody molecules which recognize NPC1L1 from any species, preferably mammals 
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(eg., cat, sheep or horse). The present invention also includes complexes comprising 
anNPClLl polypeptide of the invention and an anti-NPClLl antibody molecule. 
Such complexes can be made by simply contacting the antibody molecule with its 
cognate polypeptide. 

Various methods may be used to make the antibody molecules of the 
invention. Human antibodies can be made, for example, by methods which are 
similar to those disclosed in U.S. Patent Nos. 5,625,126; 5,877,397; 6,255,458; 
6,023,010 and 5,874,299. 

Hybridoma cells which produce the monoclonal anti-NPCl LI 
antibodies may t>e produced by methods which are commonly known in the art 
These methods include, but are not limited to, the hybridoma technique originally 
developed by Kohler, et aL 9 (1 975) (Nature 256: 495-497), as well as the trioma 
technique (Hering, et al. 9 (1988) Biomed. Biochim. Acta. 47: 21 1-216 and Hagiwara, 
et aL, (1993) Hum. Antibod. Hybridomas 4: 1 5), the human B-cell hybridoma 
technique (Kozbor, et aL, (1983) Immunology Today 4: 72 and Cote, et aL, (1983) 
Proc. Natl. Acad. Sci. U.S .A 80: 2026-2030), and the EBV-hybridoma technique 
(Cole, et aL, in Monoclonal Antibodies and Cancer Therap y. Alan R. Liss, Inc., 
pp. 77-96, 1985). ELISA may be used to determine if hybridoma cells are expressing 

anti-NPCl LI antibodies. 

The anti-NPCl LI antibody molecules of the present invention may 
also be produced recombinantly (e.g., in an Rcoli/T7 expression system as discussed 
above). In this embodiment, nucleic acids encoding the antibody molecules of the 
invention (e.g., Vh or Vl) may be inserted into a pet-based plasmid and expressed in 
the E.colilTl system. There are several methods by which to produce recombinant 
antibodies which are known in the art. An example of a method for recombinant 
production of antibodies is disclosed in U.S. Patent No. 4,816,567. See also Skerra, 
A, etaL, (1988) Science 240: 1038-1041; Better, M., et aL, (1988) Science 240: 
1041-1043 and Bird, R.E., et aL 9 (1988) Science 242: 423-426. 

The term "monoclonal antibody," includes an antibody obtained from a 
population of substantially homogeneous antibodies, i.e., the individual antibodies 
comprising the population are identical except for possible, naturally occurring 
mutations that may be present in minor amounts. Monoclonal antibodies are highly 
specific, being directed against a single antigenic site. Monoclonal antibodies are 
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advantageous in that they may be synthesized by a hybridoma culture, essentially 
uncontaminated by other iinmunoglobulins. The modifier "monoclonal" indicates the 
character of the antibody as being obtained from a substantially homogeneous 
population of antibodies, and is not to be construed as requiring production of the 
5 antibody by any particular method. The monoclonal antibodies to be used in 
accordance with the present invention may be made by the hybridoma method as 
described by Kohler, et al 9 (1975) Nature 256: 495. 

The term "polyclonal antibody* includes an antibody which was 
produced among or in the presence of one or more other, non-identical antibodies. In 

10 general, polyclonal antibodies are produced from a B-Iymphocyte in the presence of 
several other B-lymphocytes which produced non-identical antibodies. Typically, 
polyclonal antibodies are obtained directly from an immunized animal (e.g. , a rabbit). 

A "bispecific antibody" comprises two different antigen binding 
regions which bind to distinct antigens. Bispecific antibodies, as well as methods of 

1 5 making and using the antibodies, are conventional and very well known in the art 

Anti-idiotypic antibodies or anti-idiotypes are antibodies directed 
against the antigen-combining region or variable region (called the idiotype) of 
another antibody molecule. As disclosed by Jerne (Jerne, N. K., (1974) Ann. 
Immunol. (Paris) 125c: 373 and Jerne, N. KL, et aL, (1982) EMBO 1: 234), 

20 immunization with an antibody molecule expressing a paratope (antigen-combining 
site) for a given antigen (e.g., NPC1L1) will produce a group of anti-antibodies, some 
of which share, with the antigen, a complementary structure to the paratope. 
Immunization with a subpopulation of the anti-idiotypic antibodies will, in turn, 
produce a subpopulation of antibodies or immune cell subsets that are reactive to the 

25 initial antigen. 

The term "fully human antibody" refers to an antibody which 
comprises human immunoglobulin sequences only. Similarly, "mouse antibody" 
refers to an antibody which comprises mouse immunoglobulin sequences only. 

"Human/mouse chimeric antibody" refers to an antibody which 
30 comprises a mouse variable region (Vh and Vt) fused to a human constant region. 

"Humanized" anti-NPClLl antibodies are also within the scope of the 
present invention. Humanized forms of non-human (e.g., murine) antibodies are 
chimeric immunoglobulins, which contain minimal sequence derived from non- 
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human immunoglobulin. For the most part, humanized antibodies are human 
immunoglobulins (recipient antibody) in which residues from a complementary 
determining region of the recipient are replaced by residues from a complementary 
determining region of a nonhuman species (donor antibody), such as mouse, rat or 
5 rabbit, having a desired specificity, affinity and capacity- In some instances, Fv 

framework residues of the human immunoglobulin are also replaced by corresponding 

non-human residues. 

"Single-chain Fv" or "sFv" antibody fragments include the V H and/or 
Vl domains of an antibody, wherein these domains are present in a single polypeptide 

1 0 chain. Generally, the sFv polypeptide further comprises a polypeptide linker between 
the Vh and Vl domains which, enables the sFv to form the desired structure for 
antigen binding. Techniques described for the production of single chain antibodies 
(U.S. Patent Nos. 5,476,786; 5,132,405 and 4,946,778) can be adapted to produce 
anti-NPClLl specific, single chain antibodies. For a review of sFv see Pluckthun in 

15 The Pharmacology of Monoclonal Antibodies, vol. 113, Rosenburg and Moore, eds., 
Springer-Verlag, N. Y., pp. 269-3 1 5 (1 994). 

"Disulfide stabilized Fv fragments'* and "dsFv" include molecules 
having a variable heavy chain (Vh) and/or a variable light chain (VO which are linked 
by a disulfide bridge. 

20 Antibody fragments within the scope of the present invention also 

include F(ab)2 fragments which may be produced by enzymatic cleavage of an IgG 
by, for example, pepsin. Fab fragments may be produced by, for example, reduction 
ofF(ab)2 with dithiothreitol or mercaptoethylamine. 

An FV fragment is a Vl or Vh region. 

25 Depending on the amino acid sequences of the constant domain of 

their heavy chains, immunoglobulins can be assigned to different classes. There are at 
least five major classes of immxmoglobulins: IgA, IgD, IgE, IgG and IgM, and several 
of these maybe further divided into subclasses (isotypes), e.g., IgG-1, IgG-2, IgG-3 
and IgG-4; IgA-1 and IgA-2. 

30 The anti-NPCl Ll antibody molecules of the invention may also be 

conjugated to a chemical moiety. The chemical moiety may be, inter alia, a polymer, 
a radionuclide or a cytotoxic factor. Preferably, the chemical moiety is a polymer 
which increases the half-life of the antibody molecule in the body of a subject 
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Suitable polymers include, but are by no means limited to, polyethylene glycol (PEG) 
(e.g., PEG with a molecular weight of 2kDa, 5kDa, lOkDa, 12kDa, 20kDa, 30fcDa or 
40kDa), dextran and monomethoxypolyethylene glycol (mPEG). Methods for 
producing PEGylated anti-IL8 antibodies which are described in U.S. Patent No. 
5 6,133,426 can be applied to the production of PEGylated anti-NPClLl antibodies of 

■ 

the invention. Lee, etaL, (1999) (Bioconj. Chem. 10: 973-981) discloses PEG 
conjugated single-chain antibodies. Wen, etaL, (2001) (Bioconj. Chem. 12: 545-553) 
discloses conjugating antibodies with PEG which is attached to a radiometal chelator 
(diethylenetriaminpentaacetic acid (DTP A)). 
10 The antibody molecules of the invention may also be conjugated with 

labels such as "Tc, *Y, '"in, 32 P, 14 C, 125 1, 3 H, ,31 I, n C, ls O, l3 N, ,8 F, 35 S, 51 Cr, ^To, 
^a, ^Co, 59 Fe, 57 Se, l52 Eu, 67 CU, 2l7 Ci, 211 At, 212 Pb, 47 Sc, I09 Pd, 234 Th, 157 Gd, 
^Mn^ror^TFe. 

The antibody molecules of the invention may also be conjugated with 
1 5 fluorescent or chemilluminescent labels, including fluorophores such as rare earth 

chelates, fluorescein and its derivatives, rhodamine and its derivatives, isothiocyanate, 
phycoerythrin, phycocyanin, allophycocyanin, o-phthaladehyde, fluorescamine, Eu, 
dansyl, umbelliferone, luciferin, luminal label, isoluminal label, an aromatic 
acridinium ester label, an imidazole label, an acridimium salt label, an oxalate ester 
20 label, an aequorin label, 2,3-dihydrophthalazinediones, biotin/avidin, spin labels and 
stable free radicals. 

The antibody molecules may also be conjugated to a cytotoxic factor 
such as diptheria toxin, Pseudomonas aeruginosa exotoxin A chain, ricin A chain, 
abrin A chain, modeccin A chain, alpha-sarcin, Aleurites fordii proteins and 
25 compounds (e.£., fatty acids), dianthin proteins, Phytolacca americana proteins PAPI, 
PAPII, and PAP-S, momordica charantia inhibitor, curcin, crotin, saponaria 
officinalis inhibitor, mitogellin, restrictocin, phenomycin, and enomycin. 

Any method known in the art for conjugating the antibody molecules 
of the invention to the various moieties may be employed, including those methods 
30 described by Hunter, et aL, (1962) Nature 144: 945; David, et al., (1974) 

Biochemistry 13: 1014; Pain, et aL, (1981) J. Immunol. Meth. 40: 219; and Nygren, 
J., (1982) Histochem. and Cytochem. 30: 407. 
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Methods for conjugating antibodies are conventional and very well 
known in the art. 

Screening Assays 
The invention allows the identification of selective ligands of NPC1L1 
5 (e.g. y SEQ ID NO: 2, 4 or 12) that may be useful in treatment and management of a 
variety of medical conditions, including elevated serum sterol (e.g., cholesterol) or 
5a-stanol. Thus, NPC1L1 of this invention can be employed in screening systems to 
identify ligands. These ligands may be agonists or antagonists of NPC1L1 . 
Essentially, these assays provide methods for identifying ligands of NPC1L1 by using 

10 (1) NPC1L1, (2) an appropriate known NPC1L1 ligand, agonist or antagonist, for 
example, a sterol (such as cholesterol, phytosterols, including, but not limited to, 
sitosterol, campesterol, stigmasterol and avenosterol), a cholesterol oxidation product, 
a 5a-stanol (including, but not limited to, cholestanol, 5a-campestanol and 5a- 
sitostanol), a substituted azetidinone (e.g., ezetimibe), BODIPY-ezetimibe (Altmann, 

15 et al, (2002) Biochim. Biophys. Acta 1 580(1): 77-93) or 4", 6"-bis[(2- 

fluorophenyl)carbamoyl]-beta-D-cellobiosyl derivative of 1 1-ketotigogenin as 
described in DeNinno, et a/., (1997) (J. Med. Chem. 40(16): 2547-54) or any 
substituted azetidinone, and (3) a sample to be tested for the presence of a candidate 
NPC1L1 ligand. 

20 The term "specific" when used to describe binding of, for example, a 

ligand of NPC1L1 in a screening assay is a term of art which refers to the extent by 
which the ligand or antagonist (e.g. , substituted azetidinone, ezetimibe, sterol (such as 
cholesterol) or 5<x-stanol) binds preferentially to NPC1L1 in comparison to other 
proteins in the assay system. For example, detection of the specific binding of a 

25 ligand of NPC1L1 binds specifically to NPC1L1 is made apparent when a signal 
generated in the assay to indicate such binding exceeds, to any extent, a signal 
generated in a negative control wherein, for example, NPC1L1 or ligand is absent 
Furthermore, "specific binding" includes binding of a ligand either directly to 
NPC1L1 or indirectly, for example via another moiety, in a complex of which 

30 NPC1L1 is a part. The moiety to which an NPC1L1 ligand binds can be another 
protein or a post-translational modification of NPC1L1 (e.g., a lipid chain or a 
carbohydrate chain). 
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Non-limiting examples of suitable substituted azetidinones for use in 
the screening assays include (hose disclosed in U.S. Patent Nos. RE37,721 ; 
5,631,365; 5,767,115; 5,846,966; 5,688,990; 5,656,624; 5,624,920; 5,698,548; 
5,756,470; 5,688,787; 5,306,817; 5,633,246; 5,627,176; 5,688,785; 5,744,467; 
5 5,846,966; 5,728,827; 6,632,933 and U.S. Pateaat Application Publication No 
2003/01 05028-each of which is herein incorporated by reference in its entirety. 

The present invention provides for a method by which to evaluate 
whether a sample contains an NPC1L1 ligand by determining whether the sample 
contains a candidate compound which competes for binding between the known 
10 ligand (e.^., ezetimibe, ezetimibe-glucuronide, compound 2, etc.) andNPClLl. The 
ligand may be an agonist or antagonist. In an embodiment of the invention, the 
binding of the known ligand (e.g., ezetimibe, ezetimibe-glucuronide, compound 2, 
etc.) to NPC1L1 is disrupted. The term "known ligand" refers to a compound which 
is known to bind to NPC1L1 and which can be detectably labeled for use in the 
15 screening assays and methods described herein.. "Known ligands" include the 
substituted 2-azetidinone glucuronic! es which can be detectably labeled for use in 
screening assays as described herein. 

Ezetimibe can be prepared by a variety of methods well know to those 
skilled in the art, for example such as are disclosed in U.S. Patents Nos. 5,631,365, 
20 5,767,1 1 5, 5,846,966, 6,207,822, U.S. Patent Application Publication No. 
2002/0193607 and PCT Patent Application WO 93/02048, each of which is 
incorporated herein by reference in its entirety. 

"Sample*', "candidate compound" or "candidate substance" refers to a 
compound or composition which is evaluated in a test or assay, for example, for the 
25 ability to bind to NPC1L1 (e.g. 9 SEQ ID NO: 2, 4 or 12) or a functional fragment 
thereof. The composition may comprise candidate compounds, such as small 
molecules, peptides, nucleotides, polynucleotides, subatomic particles (e.g., a 
particles, P particles) or antibodies. 

The present invention provides methods for identifying ligands of a 
30 compound that binds to NPC1L1 which involve contacting NPC1L1 with a detectably 
labeled substituted 2-azetidinone, preferably substituted 2-azetidmone-glucuronide, 
and a candidate compound, and determining whether the candidate compound binds 
to NPC1L1, wherein binding of said candidate compound to NPC1L1 modulates 
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binding of the detectably labeled substituted 2-azetidinone to NPC1 LI . The 
modulation of the binding of the substituted 2-azetidinone to NPC1L1 by the binding 
of the candidate compound to NPC1L1 indicates that the candidate compound is a 
ligand that binds to NPC1L1 . It is also a good indication that the candidate compound 
may be an inhibitor of sterol and 5a-stanol absorption in vivo. 

The present invention also provides a method for identifying a ligand 
of NPC1L1 comprising contacting NPC1L1 with a detectably labeled substituted 2- 
azetidinone, preferably substituted 2-azetidinone-glucuronide, and measuring the 
binding of NPC1L1 of the detectably labeled substituted 2-azetidinone in the presence 
and absence of a candidate compound, wherein decreased binding of the detectably 
labeled substituted 2-azetidinone to the NPC1L1 in the presence of the candidate 
compound indicates that said candidate compound is a ligand of NPC1L1 and is an 
inhibitor of sterol and 5a-stanol absorption. 

The substituted 2-azetidinone is detectably labeled with 3 H, 35 S, 125 I, or 
a fluorescently labeled substituted 2-azetidinone. Preferably, the substituted 
2-azetidinone is labeled with 35 S or 125 I, and particularly 35 S. 

Preferably, the substituted 2-azetidinone is substituted 2-azetidinone- 
glucuronide. Compounds that are substituted 2-azetidinone-glucuronides are those 
having the following structure (I): 

o H 




wherein X 1 represents a group that links the glucuronide to the 
4-phenyl ring, for example but not limited to -O- or ~C U3 alkyl-, X 2 represents an 
optionally substituted -alkanediyl-, and wherein any of the phenyl groups may be 
optionally substituted. Examples of the phenyl-X 2 -moiety in structure (I) include 
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those represented at the 4-position on the 2-azetidinone structure shown below in 
structure (II). Additional examples of substituted 2-azetidinone-glucuronides include 
but are not limited to those described in U.S. Patent No. 5,756,470, WO02/066464 
and US 2002/0137689. Additional examples of substituted 2 -azetidinone-glucuronide 
compounds include those having the structure (II) and phannaceutically acceptable 
salts and esters thereof as follows: 



Ar 1 (X) m (C) q (C)r <A> 

R 1 R 3 




(n) 

wherein: 

Arl is selected from the group consisting of aryl and R 4 -substituted aryl; 

X, Y and Z are independently selected from the group consisting of -CH2-, -CH(Ci- 

6alkyl)- and -C(Cl-6alkyl)2S 
R is selected from the group consisting of -OR6 ~0(CO)R 6 , -0(CO)OR 9 , 
-CKCCtyNR^ 7 , a sugar residue, a disugar residue, a trisugar residue and a 
tetrasugar residue; 

Rl is selected from the group consisting of -H, -Ci-6alkyl and aryl, or R and Rl 
together are oxo; 

R2 is selected from the group consisting of -OR6, -0(CO)R 6 , -0(CO)OR 9 and 
-0(CO)NR 6 R 7 ; 

R3 is selected from the group consisting of -H, -Ci-6alkyl and aryl or R2 and R3 

4 

together are oxo; 
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q, r and t are each independently selected from 0 and 1 ; 
m, n and p are each independently selected from 0, l a 2, 3 and 4; 
R4 is 1-5 substituents independently selected at each occurrence from the group 
consisting of: 

-OR5, -0(CO)R5, -0(CO)OR8, -O-Ci-salkyl-OR*, -0(CO)NR5r6, -NR5R6, - 
^5(00^6, -NR5(CO)OR8, -NR5(CO)NR6r7, -NR5S02R 8 , -COOR5, - 
CONR5R6, -COR5, 

-S02NR5R6, -S(0) t R8, -O-CM0alkyl-COOR5, -0-Ci-ioalkyl-CONR5R6 and 
fluoro; 

R5, r6 and R 7 are independently selected at each occurrence from the group 
consisting of -H, Ci-6alkyl, aryl and aryl-substituted Ci-^alkyl; 

R8 is independently selected from the group consisting of Ci-galkyl, aryl and aryl- 
substituted Ci-galkyI; 

R9 is selected from the group consisting of -C^C-CH2-NRlORl 1, -OC-C(0)Rl3, 
and-(CH2)3-NRlORl4 ; 

RlO is independently selected at each occurrence from -H and — Ci-3alkyl; 

Rl 1 is selected from the group consisting of-H, - Ci-3alkyl, -C(0)-Ci-3alkyl, - 
CCCO-NRlORlO, -S02-Ci-3alkyl, and -S02-phenyl; and 

Rl2 is selected from 




CO2H and COaMe 

(referred to herein as "gjucuronide") (referred to herein as "methyl ester 
glucuronide"); 

Rl3 is selected from the group consisting of -OH and-NRlOR 1 *; and 
Rl4 is selected from the group consisting of -C(6)-Ci-3alkyl, -C(0>NR10R 10 , 
-S02-Ci-3alkyl and -S02-phenyl. 

In one embodiment of Formula II are compounds wherein q, r and t are 
each independently selected from 0 and 1 ; and m, n and p are each independently 
selected from 0, 1, 2, 3 and 4; provided that at least one of q and r is 1, and the sum of 
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m, n, p, q are r is 1, 2, 3, 4, 5 or 6; and provided that when p is 0 and r is 1, the sum of 
m, q and n is 1, 2, 3, 4, or 5. In a second embodiment of Formula II are compounds of 
Formula Ha, 




In a class of each of these embodiments are compounds wherein R9 is -O^C-CH2- 
NRlORl 1 . in another class of each of these embodiments are compounds wherein R 9 
contains an -SO2- group, i.e., wherein R9 is selected from the group consisting of 
_>C^C-CH 2 -NRlORl 1, -OsC-C(O) NRlORll, -(CH 2 )3-NRl0-SO2-Ci.3alkyl and 

-(CH2)3-NRl0-sO2-phenyl, and Rl 1 is selected from -S02-Ci-3alkyi, and -SO2- 
phenyl. 

The term "alkyl" is intended to include both branched- and straight- 
chain saturated aliphatic univalent hydrocarbon groups having the specified number 
of carbon atoms. Examples of alkyl groups include, but are not limited to, methyl 
(Me), ethyl (Et), n-propyl (Pr), n-butyi (Bu), n-pentyl, n-hexyl, and the isomers 
thereof such as isopropyl (i-Pr), isobutyl (i-Bu), secbutyl (s-Bu), tertbutyl (t-Bu), 
isopentyi, isohexyi and the like. If there is no specified prefix (such as "n-" for 
normal, "s-" for sec, "t-" for tert, "i-" for iso) with a named alkyl group, then it is 
intended that the named alkyl group is an n-alkyl group (i.e., "propyl" is "n-propyl"). 
The term "aryP* is intended to include phenyl (Ph), naphthyl, indenyl, 
tetrahydronaphthyl or indanyl. Phenyl is preferred. 

Suitable protecting groups (designated as "PG" herein) for the 
hydroxyl groups of Rl2 when Rl2 is a glucuronide or methyl ester glucuronide 
include but are not limited to those that are known to be useful as carbohydrate 
protecting groups, such as for example benzyl, acetyl, benzoyl, tert- 
butyldiphenylsilyl, trimethylsilyl, /rara-methoxybenzyl, benzylidine, and metJioxy 
methyl. Conditions required to selectively add and remove such protecting groups are 
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found in standard textbooks such as Greene, T, and Wuts, P. G. M., Protective 
Groups in Organic Synthesis, John Wiley & Sons, Inc., New York, NY, 1999. 

Compounds of Formula II may contain one or more asymmetric 
centers and can thus occur as racemates and racemic mixtures, single enantiomers, 
5 enantiomeric mixtures, diastereomeric mixtures and individual diastereomers, and all 
such isomeric forms are within the scope of Formula II. 

Radioactive isotopes of the compounds of Formula II are particularly 

9 useful in such assays, for example compounds of Formula EL wherein sulfur is 
replaced with "hot" -35S-, and particularly wherein the radioactive sulfur isotope is 

1 0 incorporated within the R$ moiety. The use of all such radioactive isotopes of the 
compounds of Formula II in an assay for identifying NPC1L1 ligands is included 

within the scope of this invention. 

The term "pharmaceutically acceptable salts" means non-toxic salts of 
the compounds of Formula II which are generally prepared by reacting the free acid 

15 with a suitable organic or inorganic base, particularly those formed from cations such 
as sodium, potassium, aluminum, calcium, lithium, magnesium, zinc and 
tetramethylammonium, as well as those salts formed from amines such as ammonia, 
ethylenediamine, N-methylglucamine, lysine, arginine, ornithine, choline, 
N,N'-dibenzylethyienediamine, chloroprocaine, diethanolamine, procaine, 

20 N-benzylphenethylamine, 1 -p-chlorobenzyl-2-pyrrolidine-l '-yl-methylbenzimidazole, 
diethylamine, piperazine, morpholine, 2,4,4-trimethyl-2-pentamine and 
tris(hydroxymethyl)aminomethane. 

When the compounds of Formula II are basic, salts may be prepared 
from pharmaceutically acceptable non-toxic acids, including inorganic and organic 

25 acids. Such acids include acetic, benzenesulfonic, benzoic, camphorsulfonic, citric, 
ethanesulfonic, fumaric, gluconic, glutamic, hydrobromic, hydrochloric, isethionic, 
lactic, maleic, malic, mandelic, methanesulfonic, mucic, nitric, pamoic, pantothenic, 
phosphoric, succinic, sulfuric, tartaric, p-toluenesulfonic acid, and the like. 
Particularly preferred are citric, hydrobromic, hydrochloric, maleic, phosphoric, 

30 sulfuric, and tartaric acids. 

Examples of pharmaceutically acceptable esters include, but are not 
limited to, -Ci-4 alkyl and -Cl-4 alkyl substituted with phenyl, dimethylamino and 
acetylamino. "Ci^ alkyl" herein includes straight or branched aliphatic chains 
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containing from 1 to 4 carbon atoms, for example methyl, ethyl, n-propyi, n-butyi, 
iso-propyl, sec-butyl and tert-butyl. 

The compounds of structural Formula II can be prepared according to 
the procedures of the following Scheme using appropriate materials, and are further 
5 exemplified by specific examples which follow. A variety of chromatographic 
techniques may be employed in the preparation of the compounds of Formula II. 
These techniques include, but are not limited to: Higji Performance Liquid 
Chromatography (including normal- reversed- and chiral-phase); Super Critical Fluid 
Chromatography; preparative Thin Layer Chromatography; flash chromatography 
10 with silica gel or reversed-phase silica gel; ion-exchange chromatography; and radial 
chromatography. All temperatures are degrees Celsius unless otherwise noted. 

Some abbreviations used herein include: 



Ac Acyl (CH 3 C(0)-) 

Bn benzyl 

calc. Calculated 

Celite CeliteTM diatomaceous earth 
Dess-Martin Periodinane l,l,l-tris(acetyloxy)-l,l-dihydro-l^-benzodoxol-3- 

(lii)-one 

DMF N^-dimethylformamide 

equiv. Equivalents) 

ES-MS Electron Spray Ion-Mass Spectroscopy 

EtOAc Ethyl acetate 

h Hour(s) 

HPLC High performance liquid chromatography 

min Minute(s) 

m.p. Melting point 

MS Mass spectrum 

r.t (or rt) Room temperature 

TFA Trifluoroacetic acid 

THF Tetrahydrofuran 

Tic Thin layer chromatography 



The general Scheme below illustrates a method for the syntheses of 
1 5 compounds of structural formula II-4. All substituents are as defined in Formula II 
unless indicated otherwise. In this method, H-l is treated with a terminal alkyne of 
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type n-2 in the presence of a suitable palladium catalyst such as 
tetralristriphenylphosphine palladium(O) or [ 1,1 *- 

bis(diphenylphosphino)ferrocene]dichloropaUadium(II) or the like, and copper© 
iodide. The reaction is usually performed in an inert organic solvent such as DMF, 
5 between room temperature and 100 °C, for a period of 6-48 h> -and the product is an 
internal alkyne of structural formula 11-3. Alkyne 11-2 may contain a radioactive atom 
such as 35S to provide the corresponding radiolabeled adduct upon reaction with ET-1 . 
Conversion of II-3 to II-4 can be achieved using a variety of hydrolytic methods 
known to those skilled in the art of organic synthesis. For example, a particularly 
1 0 mild hydrolysis protocol involves the treatment of II-3 with a tertiary amine base such 
as triethylamine, or diisopropylethylamine or the like, in a mixed solvent system 
comprising methanol and water. The product of the reaction is a compound of 
structural formula II-4. By utilizing the procedures described therein, one of ordinary 
skill in the art can readily prepare additional compounds of Fonmual II. 

15 SCHEME 



HO OH 
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Two additional types of screening systems that can be used include a 
labeled-ligand binding assay {e.g., direct binding assay or scintillation proximity 
assay (SPA)) and a "sterol (e.g., cholesterol) or 5a-stanol uptake" assay. A labeled 
ligand, for use in the binding assay, can be obtained by labeling a sterol (e.g.» 
5 cholesterol) or a 5a-stanol or a known NPC1L1 agonist or antagonist with a 
measurable group (eg., 35 S, 125 I or 3 H). Various labeled forms of sterols (e.g. 9 
cholesterol) or 5a-stanois are available commercially or can be generated using 
standard techniques (e.g., Cholesterol- [1> 3 H(N)], Cholesterol-[l ,2,6,7- 3 H(N)] or 
Cholesterol-[7- 3 H(N)]; American Radiolabeled Chemicals, inc; St. Louis, MO). In a 
10 preferred embodiment, ezetimibe is fluorescently labeled with a BODIPY group 
(Altmann, et al 9 (2002) Biochim. Biophys. Acta 1580(1): 77-93) or labeled with a 
detectable group such as 35 S, ,25 I or *H, preferably 35 S. 

Direct Binding Assay. Typically, a given amount of NPC1L1 of the 
invention (e.g., SEQ ID NO: 2, 4 or 12) or a complex including NPC1L1 is contacted 
15 with increasing amounts of labeled ligand or known antagonist or agonist (discussed 
above) and the amount of the bound, labeled ligand or known antagonist or agonist is 
measured after removing unbound, labeled ligand or known antagonist or agonist by 
washing. As the amount of the labeled ligand or known agonist or antagonist is 
increased, a point is eventually reached at which all receptor binding sites are 
20 occupied or saturated. Specific receptor binding of the labeled ligand or known 
agonist or antagonist is abolished by a large excess of unlabeled ligand or known 
agonist or antagonist 

Preferably, an assay system is used in which non-specific binding of 
the labeled ligand or known antagonist or agonist to the receptor is minimal. Non- 
25 specific binding is typically less than 50%, preferably less than 15%, more preferably 
less than 1 0%, and most preferably 5% or less, of the total binding of the labeled 
ligand or known antagonist or agonist 

In the basic binding assay, the method for identifying an NPC1 LI 
ligand, agonist or antagonist includes: 
30 (a) contacting NPC1L1 (e.g, SEQ ID NO: 2 or 4 or 12), a 

fragment thereof or a complex including NPC1L1, in the presence of a known amount 
of labeled sterol (e.g., cholesterol) or 5a-stanol or known antagonist or agonist (e.g., 
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labeled ezetimibe) with a sample to be tested for the presence of an NPC1L1 ligand, 

agonist or antagonist; and 

(b) measuring the amount of labeled sterol (e.g. 9 cholesterol) or 5a- 
stanol or known antagonist or agonist directly or indirectly bound to NPC1L1. 
5 An NPC1L1 ligand in the sample is identified by measuring 

substantially reduced direct or indirect binding of the labeled sterol (e.g. 9 cholesterol) 
or 5a-stanol or known antagonist or agonist to NPC1L1, compared to what would be 
measured in the absence of such a ligand. For example, reduced direct or indirect 
binding between [ 3 H]-cholesterol and NPC1L1 in the presence of a sample might 

3 

10 suggest that the sample contains a substance which is competing against [ HQ- 
cholesterol forNPClLl binding. 

This assay can include a control experiment lacking any NPC1L1 - 
dependent ligand (e.g., sterol such as cholesterol or 5a-stanol) binding. In this assay, 
for example, a whole cell or cell membrane lacking any functional NPC1L1, for 

1 5 example, a cell or membrane isolated or derived from a transgenic mutant npcl U~ 
mouse of the invention, is assayed for ligand binding. When screening a sample for 
the presence of an NPC1L1 antagonist, it is useful to compare the level of binding 
observed in the presence of a sample being tested with that of a control experiment, as 
described herein, which completely lacks NPC1L1 -dependent binding. Ideally, 

20 though by no means necessarily, the level of binding seen in the presence of a sample 
containing an antagonist will be similar to that of the control experiment. 

Alternatively, a sample can be tested directly for binding to NPC1L1 
(e.g., SEQ ID NO: 2, 4 or 12). A basic assay of this type may include the following 
steps: 

25 (a) contacting NPC1L1 {e.g., SEQ ID NO: 2 or 4 or 12), a 

fragment thereof or a complex including NPC1L1 with a labeled candidate compound 

(e.g., [ 3 H]~ezetimibe); and 

(b) detecting direct or indirect binding between the labeled 

candidate compound and NPC1L1. 
30 Again, these experiment can be performed along with a control 

experiment wherein NPC1 Ll-dependent binding is completely lacking. For example, 
the assay can be performed using a whole cell or cell membrane lacking any 
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functional NPC1L1 (e.g. 9 cell or cell membrane derived from a transgenic, mutant 
npclir mouse as described herein). 

A candidate compound which is found to bind to NPC1L1 may 
function as ligand, agonist or antagonist of NPC1L1 (e.g., by inhibition of sterol 
(e.g., cholesterol) or 5a-stanol uptake). 

In an embodiment of the invention, the bound candidate compound is 
quantified after filtration using glass fiber filters. In one aspect of this embodiment, 
the bound candidate compound is detected after single-tube vacuum filtration of GF/C 
glass fiber filters, obtained from Whatman. The filters may be pretreated by soaking 
with 0.5% polyethylenimine to reduce nonspecific binding. Filtration is accomplished 
by adding ice cold buffer to the assay tube, pouring the mixture through the filter, and 
then rinsing the tube and filter twice more with additional buffer. The buffer may be 
a Tris buffer or MES buffer (120 mM NaCl, 0.1% sodium cholate, and 20 mM MES 
at pH 6.70). The filters can be counted using scintillation fluid, e.g., Packard DM 
liquid or Packard Ultima Gold MV. 

Alternatively, vacuum filtration of the sample on a Milliore 96-well 
plate (Whatman GF/C) can also be used to achieve adequate precision in a manner 
well-known to those dolled in the art 

SPA Assay. NPC1L1 ligands may also be measured using scintillation 
proximity assays (SPA). SPA assays are conventional and very well known in the art; 
see, for example, U.S. Patent No. 4,568,649. In SPA, the target of interest is 
immobilized to a small microsphere approximately 5 microns in diameter. The 
microsphere, typically, includes a solid scintillant core which has been coated with a 
polyhydroxy film, which in turn contains coupling molecules, which allow generic 
links for assay design. When a radioisotopically labeled molecule binds to the 
microsphere, the radioisotope is brought into close proximity to the scintillant and 
effective energy transfer from electrons emitted by the isotope will take place 
resulting in the emission of light. While the radioisotope remains in free solution, it is 
too distant from the scintillant and the electron will dissipate the energy into the 
aqueous medium and therefore remain undetected. Scintillation may be detected with 
a scintillation counter. In general, 3 H, 125 I and 35 S labels are well suited to SPA. 

For the assay of receptor-mediated binding events, the lectin wheat 
germ agglutinin (WGA) may be used as the SPA bead coupling molecule ( Amersham 
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Biosciences; Piscataway, NJ). The WGA coupled bead captures glycosylated, 
cellular membranes and glycoproteins and has been used for a wide variety of 
receptor sources and cultured cell membranes. The receptor is immobilized onto the 
WGA-SPA bead and a signal is generated on binding of an isotopically labeled 
5 ligand. Other coupling molecules which may be useful for receptor binding SPA 
assays include poly-L-lysine and WGA/polyethyleneimine (Amersham Biosciences; 
Piscataway, NJ). See, for example, Berry, J.A., et a/., (1991) Cardiovascular 
Pharmacol. 17 (Suppl.7): S143-S145; Hoffinan, R., et al 9 (1992) Anal. Biochem. 203: 
70-75; Kienhus, et al. 9 (1992) J. Receptor Research 12: 389-399; Jing, S., et al % 

10 (1992) Neuron 9: 1067-1079. 

The scintillant contained in SPA beads may include, for example, 
yttrium silicate (YSi), yttrium oxide (YOx), diphenyloxazole or polyvinyltoluene 
(PVT) which acts as a solid solvent for diphenylanthracine (DPA). 

SPA assays may be used to analyze whether a sample contains an 

15 NPC1L1 ligand. In these assays, a host cell which expresses NPC1L1 {e.g., SEQ ID 
NO: 2 or 4 or 12) on the cell surface or a membrane fraction thereof is incubated with 
and captured by SPA beads (eg., WGA coated YOx beads or WGA coated YSi 
beads). The beads bearing the NPC1 LI are incubated with labeled, known ligand or 
agonist or antagonist {e.g., 3 H-cholesterol, 3 H-ezetimibe, 125 I-ezetimibe or a 3S S- 

20 ezetimibe analog). The assay mixture further includes either the sample to be tested 
or a blank {e.g., water). After an optional incubation, scintillation is measured using a 
scintillation counter. An NPC1L1 ligand, agonist or antagonist may be identified in 
the sample by measuring substantially reduced fluorescence, compared to what would 
be measured in the absence of such ligand, agonist or antagonist (blank). Measuring 

25 substantially reduced fluorescence may suggest that the sample contains a substance 
which competes for direct or indirect NPC1L1 binding with the known ligand, agonist 
or antagonist. 

Alternatively, a sample may be identified as an ligand of NPC1L1 by 
directly detecting binding in a SPA assay. In this assay, a labeled version of a 
30 candidate compound to be tested may be put in contact with the host cell expressing 
NPC1L1 or a membrane fraction thereof which is bound to the SPA bead. 
Fluorescence may then be assayed to detect the presence of a complex between the 
labeled candidate compound and the host cell or membrane fraction expressing 



-42- 



WO 2005/069900 



PCT/US2005/001469 



NPC1L1 or a complex including NPC1L1 . A candidate compound which binds 
directly or indirectly to NPC1L1 may possess NPC1 LI agonistic or antagonistic 
activity. 

SPA Assays can also be performed along with a control experiment 
5 lacking any NPC1 LI -dependent binding. The control experiment can be performed, 
for example, with a cell or cell membrane lacking any functional NPC1L1 (e.g, cell 
or cell membrane derived from a transgenic, mutant npclll- mouse as described 
herein). When the control experiment is performed, the level of binding observed in 
the presence of sample being tested for the presence of an antagonist can be compared 
1 0 with that observed in the control experiment. 

SteroUSa-stanol Uptake Assay. Assays may also be performed to 
determine if a sample can agonize or antagonize NPC1L1 mediated sterol (e.g., 
cholesterol) or 5a-stanol uptake. In these assays, a host cell expressing NPC1L1 
(e.g. y SEQ ID NO: 2 or 4 or 12) on the cell surface (discussed above) can be contacted 
1 5 with detectably labeled sterol (e.g., 3 H-cholesterol or l25 I-cholesterol)) or 5a-stanol 
along with either a sample or a blank. After an optional incubation, the cells can be 
washed to remove unabsorbed sterol or 5a-stanol. Sterol or 5a-stanol uptake can be 
determined by detecting the presence of labeled sterol or 5a-stanol in the host cells. 
For example, assayed cells or lysates or fractions thereof (e.g., fractions resolved by 
20 thin-layer chromatography) can be contacted with a liquid scintillant and scintillation 
can be measured using a scintillation counter. 

In these assays, an NPC1L1 antagonist in the sample may be identified 
by measuring substantially reduced uptake of labeled sterol (e.g. 9 3 H-cholesterol) or 
5a-stanol, compared to what would be measured in the absence of such an antagonist 
25 and an agonist may be identified by measuring substantially increased uptake of 
labeled sterol (e.g., 3 H-cholesterol) or Sa-stanol, compared to what would be 
measured in the absence of such an agonist. 

Uptake assays can also be performed along with a control experiment 
lacking any NPClLl-dependent uptake. The control experiment can be performed, 
30 for example, with a cell lacking any functional NPC1L1 (e.g., cell derived from a 

transgenic, mutant npclll' mouse as described herein). When the control experiment 
is performed, the level of uptake observed in the presence of sample being tested for 
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the presence of an antagonist can be compared with that observed in the control 
experiment 

Source ofNPClLl. In principle, a binding assay of the invention 
could be carried out using a soluble NPC1L1 polypeptide of the invention, e.g. 9 
5 following production and refolding by standard methods from an E. coli or other 

prokaryotic or eukaryotic expression system, and the resulting receptor-labeled ligand 
complex could be precipitated, e.g., using an antibody against the receptor. The 
precipitate could then be washed and the amount of the bound, labeled ligand or 
antagonist or agonist could be measured. 

10 Alternatively, NPC1L1 is membrane-bound. A nucleic acid encoding 

an NPC1L1 polypeptide of the invention (e.g. 9 SEQ ID NO: 2, 4 or 12) can be 
trans fected into an appropriate host cell, whereby the NPC1L1 will become 
incorporated into the membrane of the cell. A membrane fraction can then be isolated 
from the cell and used as a source of NPC1L1 for assay. Alternatively, the whole cell 

15 expressing NPC1L1 in the cell surface can be used in an assay. Preferably, specific 
binding of the labeled ligand or known antagonist or agonist to an 
untransfected/untransformed host cell or to a membrane fraction from an 
untransfectedAmtransformed host cell will be negligible. 

Various membranes may be used directly as a source of NPC 1 LI for 

20 the above-described screening systems, e.g. direct binding, scintillation proximity 
assay, sterol/5a stanol uptake assay. As described in Examples 5, 6, 7, 8, 9, 17, 27, 
and 29, NPC 1 LI is highly expressed in certain tissues, especially in brush border cells 
of intestinal tissues. Therefore, brush border membrane (BBM) vesicle preparations 
may be utilized as a source of NPC1L1 . The membranes may be derived from 

25 mammalian intestinal tissue from rhesus, rat, mouse or human tissue. 

Membranes may be derived from brush bonier cells of intestinal 
tissues. Such membranes are conventionally prepared by collecting intestinal tissue 
from freshly sacrificed animals. The mucosa of the tissue is scraped, collected into 
buffered solutions, and homogenized. Cellular debris is removed and the membrane 

30 fractions are collected by centrifugation. Conventional techniques known to one of 
skill in the art maybe used for the preparation of brush border membrane vesicles. 
See Hauser, H., Howell, K., Dawson, R.M.C., Bowyer, D. E. Biochim. Biophys. Acta 
602. 567-577 (1980); Kramer, W., Girbig, F., Gutjahr, U., Kowalewski, S., Jouvenal, 
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K., Muller, G., Tripier, D., Wess, G. J. Biol. Chem. 268, 18035-18046 (1993); 
Rigtrup, K.M., Ong, D.E. Biochemistry 31, 2920-2926 (1992). 

The membrane preparation may be in vesicular or non-vesicular form. 
Alternatively, liposomes and liposomal preparations comprising 
NPC1L1 may also be a viable source of NPC1L1 for the screening assays of the 
present claimed method. 

In vitro cultured cells expressing NPC1L1 may also be used. The host 
cells may be prepared by transforming or transfecting a nucleic acid encoding an 
NPC1L1 of the invention into an appropriate host cell, whereby the receptor becomes 
incorporated into the membrane of the cell. A membrane fraction can then be isolated 
from the cell and used as a source of the receptor for assay. Alternatively, the whole 
cell expressing the receptor on the cell surface can be used in an assay. Preferably, 
specific binding of the labeled ligand or known antagonist or agonist to an 
untransfected/untransformed host cell or membrane fraction from an 
untransfected/untransformed host cell will be negligible. 

Preferred host cells include Chinese Hamster Ovary (CHO) cells, 
murine macrophage J774 cells, HEK-293 cells or any other macrophage cell line and 
human intestinal epithelial Caco2 cells. 

The present invention provides for a method of identifying a ligand of 
NPC1L1 using these membrane preparations, for example by contacting membranes 
comprising NPC1L1, such as brush border membrane vesicle preparations, with 
detectably labeled substituted azetidinone compounds which are known NPC1L1 
ligands, agonists or antagonists, and a candidate compound and determining whether 
the candidate compound can bind to NPClLi. The binding of the candidate 
compound to NPC1L1 may modulate binding of the detectably labeled NPC1L1 
ligands, agonists or antagonists to NPC1L1 . In addition, a NPC1 LI ligand may be 
identified by measuring the binding of NPC1L1 with detectably labeled NPC1L1 
ligands, agonists or antagonists in the presence and absence of the candidate 
compound wherein decreased binding of the detectably labeled NPC1L1 ligands, 
agonists or antagonists to NPC1L1 is an indication that the candidate compound is 
ligand of NPC1L1. 
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NPC1L1 may also be obtained by solubilization of membrane fractions 
comprising NPC1 LI. The membranes maybe obtained as discussed above, e.g., from 
mammalian tissue or in vitro cultured cells. 

Binding Affinities ofNPClLl Ligands. The affinity and specificity of 
5 the known ligand (e.g., detectably labeled substituted 2-azetidinone-glucuronide) for 
NPC1L1 are important to the identification of ligands that bind NPC1L1 in a 
screening assay. It is understood that the known ligand will be labeled for use in the 
screening assay. In an embodiment of the invention, the binding affinity of the known 
ligand for human NPC1L1 has a Kd value equivalent or lower than the Kd value of 
10 ezetimibe glucuronide 1 for human NPC1 LI . In an aspect of this embodiment, the 
binding affinity of the known ligand for human NPC1L1 has a Kd value of about 
200nM or lower; particularly it has a Kd value of about 1 OOnM or lower, more 
particularly it has a Kd value of about 50nM or lower; even more particularly it has a 
Kd value of about 20nM or lower; and most particularly it has a Kd value of about 
15 1 OnM or lower. For usefulness in the assay, there is essentially no lower limit on the 
Kd value of the known ligand and it may, for example, go down into the pM range. 
As the Kd value decreases, the binding affinity of the ligand for human NPC1 LI 
increases, which is desirable for the screening assay. 

In another embodiment of the invention, the binding affinity of the 
20 known ligand for rat NPC1L1 has a Kd value equivalent or lower than the Kd value of 
ezetimibe glucuronide 1 for rat NPC1 LI . In an aspect of this embodiment, the 
binding affinity of the known ligand for rat NPC1L1 has a K D value of about 200nM 
or lower; particularly it has a Kd value of about lOOnM or lower; more particularly it 
has a Kd value of about 50nM or lower, even more particularly it has a Kd value of 
25 about 2 OnM or lower; and most particularly it has a Kd value of about lOnM or lower. 

In another embodiment of this invention, the known ligand for human 
NPC1L1 is selected from (a) a sulfur-containing substituted 2-azetidinone- 
glucuronide that is labeled with 35 S, and particularly a compound of Formula II 
wherein R 9 contains an -SO2- group and (b) a substituted 2-azetidinone-glucuronide 
30 labeled with 125 L 

In one aspect of this embodiment, the known ligand for human 
NPC1L1 is selected from (a) a sulfur-containing substituted 2-azetidinone- 
glucuronide that is labeled with 35 S, and particularly a compound of Formula II 
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wherein R 9 contains an -S0 2 - group, and (b) a substituted 2-azetidinone-glucuronide 
labeled with 125 I, and has a K D value equivalent or lower than the K D value of 

ezetimibe glucuronide 1 . 

In another aspect of tins embodiment, the known ligand for human 
5 NPC1L1 is selected from (a) a sulfur-containing substituted 2-azetidinone- 
glucuronide that is labeled with 35 S, and particularly a compound of Formula II 
wherein R 9 contains an -SO*- group, and (b) a substituted 2-azetidinone-glucuronide 
labeled with l25 I, and has a K D value of about 200nM or lower, particularly it has a K D 
value of about lOOnM or lower; more particularly it has a K D value of about 50nM or 

10 lower, even more particularly it has a Kd value of about 20nM or lower; and most 
particularly it has a Kd value of about lOnM or lower. 

When using 3 H-labeled ezetimibe glucuronide in screening to identify 
NPC1L1 ligands from among the candidate compounds using mouse-derived 
membranes, candidate compounds identified as NPC1L1 ligands are preferably those 

1 5 candidates that exhibit a binding affinity having a Kd value of about 1 2,OO0nM or 
lower, preferably about 1 OOOnM or lower, more preferably about 1 OOnM or lower, 
and most preferably about lOnM or lower. When using 3 H-labeled ezetimibe 
glucuronide in screening to identify NPC1L1 ligands using rat-derived membranes or 
human-derived membranes, candidate compounds identified as NPC1L1 ligands are 

20 preferably those candidates that exhibit a binding affinity having a Kp value of about 
1 OOOnM or lower, preferably about lOOnM or lower, and more preferably about lOnM 
or lower. When using 3 H-Iabeled ezetimibe glucuronide in screening to identify 
NPC1L1 ligands using rhesus monkey-derived membranes, candidate compounds 
identified as NPC1L1 ligands are preferably those candidates that exhibit a binding 

25 affinity having a Kd value of about 50nM or lower, and preferably about lOnM or 
lower. 

When using 35 S-labeled compound 2 in screening to identify NPC1L1 
ligands from among the candidate compounds using rat or human-derived 
membranes, candidate compounds identified as NPC1L1 ligands are preferably those 
30 candidates that exhibit a binding affinity having a K D value in the range from about 
lOpM to about InM. When using 125 I-labeled substituted 2-azetidinone glucuronide 
compounds in the assay with rat or human membranes, candidate compounds 
identified as NPC1L1 ligands are preferably those candidates that exhibit a binding 
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affinity having a K D value in the range from about lOnM to about lOpM, and 
preferably from about lOOpM to about lOpM. 

Mouse Assay. The present invention comprises a mutant, transgenic 
mouse which lacks any functional NPC1L1 . This mouse may serve as a convenient 
control experiment in screening assays for identifying inhibitors of intestinal sterol 
(e.g., cholesterol) or 5a-stanol absorption, preferably inhibitors of NPC1L1. 
Preferably, a mouse assay of the present invention would comprise the following 
steps: 

(a) feeding a sterol (e.g., cholesterol) or 5a-stanol-containing 
substance (e.g., comprising radiolabeled cholesterol, such as 14 C-cholesterol or 3 H- 
cholesterol) to a first and second mouse comprising a functional NPC1L1 gene and to 
a third, mutant mouse lacking a functional NPC1L1 ; 

The sterol (e.g., cholesterol) or 5a-stanol containing substance 
preferably contains labeled cholesterol, such as a radiolabeled cholesterol, for 
example, 3 H or I4 C labeled cholesterol. The sterol (eg., cholesterol) or 5a-stanol 
containing substance may also include cold, unlabeled sterol (e.g., cholesterol) or 5a- 
stanol such as in corn oil. 

In these assays, the third npcllV mutant mouse serves as a (+)-control 
experiment which exhibits low levels of intestinal sterol (e.g., cholesterol) or 5a- 
stanol absorption and the second mouse serves as a (-)-control experiment which 
exhibits normal, uninhibited levels of intestinal sterol (e.g., cholesterol) or 5ct-stanol 
absorption. The second mouse is not administered the sample to be tested for an 
NPCIL1 antagonist. The first mouse is the experiment. 

(b) administering the sample to the first mouse comprising a 
functional NPC1L1 but not to the second mouse; 

(c) measuring the amount of sterol (e.g., cholesterol) or 5a-stanol 
absorption in the intestine of said first, second and third mouse; 

Intestinal sterol (e.g., cholesterol) or 5<x-stanol absorption may be 
measured by any method known in the art. For example, the level intestinal 
absorption can be assayed by measuring the level of serum sterol (e.g. 9 cholesterol) or 
5a-stanol. 

(d) comparing the levels of intestinal sterol (e.g. , cholesterol) or 
5a-stanol absorption in each mouse; 
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wherein the sample is determined to contain the intestinal sterol (e.g, 
cholesterol) or 5a-stanol absorption antagonist when the level of intestinal sterol (e.g., 
cholesterol) or 5a-stanol absorption in the first mouse and in the third mouse are less 
than the amount of intestinal sterol (e.g., cholesterol) or 5a-stanol absorption in the 
5 second mouse. 

Preferably, if the sample contains an intestinal sterol (e.g., cholesterol) 
or 5a-stanol absorption inhibitor (e.g., an NPC1L1 inhibitor), the level of sterol (e.g., 
cholesterol) or 5a-stanol absorption in the first mouse will be similar to that of the 
third, npclll mutant mouse. 
1 0 An alternative, (+)-control experiment which may be used in these 

screening assays is a mouse comprising functional NPC1L1 which is administered a 
known antagonist of NPC1L1, such as ezetimibe. 

Pharmaceutical Compositions 
NPC1L1 ligands discovered, for example, by the screening methods 
1 5 described above may be used therapeutically (e.g., in a pharmaceutical composition) 
to stimulate or block the activity of NPC1L1 and, thereby, to treat any medical 
condition caused or mediated by NPC1L1. In addition, the antibody molecules of the 
invention may also be used therapeutically (e.g., in a pharmaceutical composition) to 
bind NPC1L1 and, thereby, block the ability of NPC1L1 to bind a sterol (e.g. 9 
20 cholesterol) or 5a-stanol. Blocking the binding of a sterol (e.g., cholesterol) or 5a- 
stanol would prevent absorption of the molecule (e.g., by intestinal cells such as 
enterocytes). Blocking absorption of sterol (e.g., cholesterol) or 5a-stanol would be a 
useful way to lower serum sterol (e.g., cholesterol) or 5a-stanol levels in a subject 
and, thereby, reduce the incidence of, for example, hyperlipidemia, atherosclerosis, 
25 coronary heart disease, stroke or arteriosclerosis. 

The term "subject" or "patient" includes any organism, preferably 
animals, more preferably mammals (e.g., mice, rats, rabbits, dogs, horses, primates, 
cats) and most preferably humans. 

The term "pharmaceutical composition" refers to a composition 
30 including an active ingredient and a pharmaceutically acceptable carrier and/or 
adjuvant. 

Although the compositions of this invention could be administered in 
simple solution, they are more typically used in combination with other materials such 
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as carriers, preferably pharmaceutically acceptable carriers. Useful, pharmaceutically 
acceptable carriers can be any compatible, non-toxic substances suitable for 
delivering the compositions of the invention to a subject. Sterile water, alcohol, fats, 
waxes, and inert solids may be included in a pharmaceutically acceptable carrier. 
5 Pharmaceutically acceptable adjuvants (buffering agents, dispersing agents) may also 
be incorporated into the pharmaceutical composition. 

Preferably, the pharmaceutical compositions of the invention are in the 
form of a pill or capsule. Methods for formulating pills and capsules are very well 
known in the art. For example, for oral administration in the form of tablets or 

1 0 capsules, the active drug component may be combined with any oral, non-toxic 

pharmaceutically acceptable inert carrier, such as lactose, starch, sucrose, cellulose, 
magnesium stearate, dicalcium phosphate, calcium sulfate, talc, mannitol, ethyl 
alcohol (liquid forms) and the like. Moreover, when desired or needed, suitable 
binders, lubricants, disintegrating agents and coloring agents may also be incorporated 

15 in the mixture. Suitable binders include starch, gelatin, natural sugars, corn 
sweeteners, natural and synthetic gums such as acacia, sodium alginate, 
carboxymethylcellulose, polyethylene glycol and waxes. Among the lubricants there 
may be mentioned for use in these dosage forms, boric acid, sodium benzoate, sodium 
acetate, sodium chloride, and the like. Disintegrants include starch, methylcellulose, 

20 guar gum and the like. Sweetening and flavoring agents and preservatives may also 
be included where appropriate. 

The pharmaceutical compositions of the invention may be 
administered in conjunction with a second pharmaceutical composition or substance. 
In preferred embodiments, the second composition includes a cholesterol-lowering 

25 drug. When a combination therapy is used, both compositions may be formulated 
into a single composition for simultaneous delivery or formulated separately into two 
or more compositions (e.g., a kit). 

The formulations may conveniently be presented in unit dosage form 
and may be prepared by any methods well known in the art of pharmacy. See, e.g. 9 

30 Gilman et ah (eds.) (1990), The Pharmacological Bases of Therapeutics, 8th Ed., 
Pergamon Press; and Remington 's Pharmaceutical Sciences, supra, Easton, Penn.; 
Avis et aL (eds.) (1993) Pharmaceutical Dosage Forms: Parenteral Medications 
Dekker, New York; Lieberman et aL (eds.) (1990) Pharmaceutical Dosage Forms: 
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Tablets Dekker, New York; and Liebennan et aL (eds.) (1990), Pharmaceutical 

Dosage Forms: Disperse Systems Dekker, New York. 

The dosage regimen involved in a therapeutic application may be 

determined by a physician, considering various factors which may modify the action 
5 of the therapeutic substance, e.g. 9 the condition, body weight, sex and diet of the 

patient, the severity of any infection, time of administration, and other clinical factors. 

Often, treatment dosages are titrated upward from a low level to optimize safety and 

efficacy. Dosages may be adjusted to account for the smaller molecular sizes and 

possibly decreased half-lives (clearance times) following administration. 
10 An "effective amount" of a ligand of the invention may be an amount 

that will detectably reduce the level of intestinal sterol (e.g. 9 cholesterol) or 5a-stanol 

absorption or detectably reduce the level of serum sterol (eg., cholesterol) or 5a- 

stanol in a subject administered the composition. 

Typical protocols for the therapeutic administration of such substances 
1 5 are well known in the art. Pharmaceutical composition of the invention may be 

administered, for example, by any parenteral or non-parenteral route. 

Pills and capsules of the invention can be administered orally. 

Injectable compositions can be administered with medical devices known in the art; 

for example, by injection with a hypodermic needle. 
20 Injectable pharmaceutical compositions of the invention may also be 

administered with a needleless hypodermic injection device; such as the devices 

disclosed in U.S. Patent Nos. 5,399,163; 5,383,851; 5,312,335; 5,064,413; 4,941,880; 

4,790,824 or 4,596,556. 

Anti-Sense 

25 The present invention also encompasses anti-sense oligonucleotides 

capable of specifically hybridizing to mRNA encoding NPC1L1 (e.g., any of SEQ ID 
NOs: 1, 3, 5-1 1 or 13) having an amino acid sequence defined by, for example, SEQ 
ID NO: 2 or 4 or 12 or a subsequence thereof so as to prevent translation of the 
mRNA. Additionally, this invention contemplates anti-sense oligonucleotides capable 

30 of specifically hybridizing to the genomic DNA molecule encoding NPC1 LI , for 
example, having an amino acid sequence defined by SEQ ID NO: 2 or 4 or 12 or a 
subsequence thereof. 
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This invention further provides pharmaceutical compositions 
comprising (a) an amount of an oligonucleotide effective to reduce NPClLl-mediated 
sterol (e.g.) cholesterol) or 5a-stanol absorption by passing through a cell membrane 
and binding specifically with mKNA encoding NPC1L1 in the cell so as to prevent its 
5 translation and (b) a pharmaceutical^ acceptable carrier capable of passing through a 
cell membrane. In an embodiment, the oligonucleotide is coupled to a substance that 
inactivates mRNA. In another embodiment, the substance that inactivates mRNA is a 
ribozyme. 

Reducing the level of NPC1L1 expression by introducing anti-sense 
10 NPC1 LI RNA into the cells of a patient is a useful method reducing intestinal sterol 
(e.g. 9 cholesterol) or 5 □ -stand absorption and serum cholesterol in the patient 

Kits 

Kits of the present invention include ezetimibe, preferably combined 
with a pharmaceutical^ acceptable carrier, in a pharmaceutical formulation, more 

15 preferably in a pharmaceutical dosage form such as a pill, a powder, an injectable 
liquid, a tablet, dispersible granules, a capsule, a cachet or a suppository. See for 
example, Gilman et al. (eds.) (1990), The Pharmacological Bases of Therapeutics, 8th 
Ed., Pergamon Press; and Remington's Pharmaceutical Sciences, supra, Easton, 
Penn.; Avis et al. (eds.) (1993) Pharmaceutical Dosage Forms: Parenteral Medications 

20 Dekker, New York; Lieberman et al. (eds.) (1990) Pharmaceutical Dosage Forms: 
Tablets Dekker, New York; and lieberman et al. (eds.) (1990), Pharmaceutical 
Dosage Forms: Disperse Systems Dekker, New York. Preferably, the dosage form is 
a Zetia® tablet (Merck/Schering-Plough Corp.). Ezetimibe may be supplied in any 
convenient form. For example, tablets including ezetimibe may be supplied in bottles 

25 of 30, 90 or 500. 

The kits of the present invention also include information, for example 
in the form of a package insert, indicating that the target of ezetimibe is NPC1L1 
(NPC3). The term 'target of ezetimibe** indicates that ezetimibe reduces intestinal 
sterol (e.g, cholesterol) or 5oc-stanol absorption, either directly or indirectly, by 

30 antagonizing NPC1 LI . The form of the insert may take any form, such as paper or on 
electronic media such as a magnetically recorded medium (e.g., floppy disk) or a CD- 
ROM. 
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The package insert may also include other information concerning the 
pharmaceutical compositions and dosage forms in the kit. Generally, such 
information aids patients and physicians in using the enclosed pharmaceutical 
compositions and dosage forms effectively and safely. For example, the following 

5 information regarding ezetimibe (e.g. , Zetia®) and/or simvastatin (e.g., Zocor®) may 
be supplied in the insert: pharmacokinetics, pharmacodynamics, clinical studies, 
efficacy parameters, indications and usage, contraindications, warnings, precautions, 
adverse reactions, overdosage, proper dosage and administration, how supplied, 
proper storage conditions, references and patent information. 

10 The kits of the invention may also include simvastatin ( 




) preferably combined with a pharmaceutically acceptable carrier, in a pharmaceutical 
formulation, more preferably in a pharmaceutical dosage form such as a pill, a 
powder, an injectable liquid, a tablet, dispersible granules, a capsule, a cachet or a 

15 suppository. Preferably, the dosage form of simvastatin is a Zocor<E> tablet (Merck & 
Co.; Whitehouse Station, NJ). 

Tablets or pills comprising simvastatin may be supplied in any 
convenient form. For example, pills or tablets comprising 5mg simvastatin can be 
supplied as follows: bottles of 30, 60, 90, 100 or 1000. Pills or tablets comprising 10 

20 mg simvastatin maybe supplied as follows: bottles of 30, 60, 90, 100, 1000 or 10,000. 
Pills or tablets comprising 20 mg simvastatin may be supplied as follows: bottles of 
30, 60, 90, 100, 1000 or 10,000. Pills or tablets comprising 40 mg simvastatin maybe 
supplied as follows: bottles of 30, 60, 90, 100 or 1000. Pills or tablets comprising 80 
mg simvastatin maybe supplied as follows: bottles of 30, 60, 90, 100, 1000 or 10,000. 
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Ezetimibe and simvastatin may be supplied, in the kit, as separate 
compositions or combined into a single composition. For example, ezetimibe and 
simvastatin may be supplied within a single, common pharmaceutical dosage form 
(e.g., pill or tablet) as in separate pharmaceutical dosage forms (e.g., two separate 
5 pills or tablets). 

npclir Cells 

The present invention provides any isolated mammalian cell, (e.g, an 
isolated mouse cell, an isolated rat cell or an isolated human cell) which lacks an 
NPC1L1 gene which encodes or can produce a functional NPC1L1 protein. Included 
1 0 within this embodiment are mutant npclll genes comprising a point mutation, 

truncation or deletion of the genetic coding region or of any regulatory element {e.g., 
a promoter). 

For example, the cell can be isolated from a mutant mouse comprising 
a homozygous mutation of endogenous, chromosomal NPC1L1 wherein the mouse 

1 5 does not produce any functional NPC1 LI protein (e.g. , the mouse described below in 
Example 22). Moreover, the present invention comprises any cell, tissue, organ, fluid, 
nucleic acid, peptide or other biological substance derived or isolated from such a 
mutant mouse, particularly a mutant, transgenic mouse which does not produce any 
functional NPC1 LI, wherein the region of endogenous, chromosomal NPC1L1 

20 deleted, in the mouse, corresponds to nucleotides 790-998 of the nucleotide sequence 
set forth in SEQ ID NO: 45. 

The isolated cell can be isolated or derived, for example, from the 
duodenum, gall bladder, liver, small intestine or stomach of the mutant mouse. 
Further, the cell can be an enterocyte. 

25 The npclir mutant cells are useful, for example, for use in control 

experiments in screening assays (see e.g., supra) since they lack any NPC1L1- 
dependent uptake or binding of sterol, 5a-stanol or ezetimibe. The level of inhibition 
caused by a particular sample, in a screening assay, can be compared to that of an 
assay performed with the mutant cell. Ideally, though by no means necessarily, in a 

30 screening assay, for example, as described herein, the same amount of binding will be 
observed by a non-mutant cell or cell membrane, in the presence of an antagonist, as 
is observed in connection with a mutant npclll cell or cell membrane alone. 
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EXAMPLES 

The following examples are provided to more clearly describe the 
present invention and should not be construed to limit the scope of the invention in 
any way. 

5 Example 1: Cloning and Expression of Rat, Mouse and Human NPC1L1. 

Rat NPC1L1, mouse NPC1L1 or human NPC1L1 can all conveniently 
be amplified using polymerase chain reaction (PCR). In this approach, DNA from a 
rat, mouse or human cDNA library can be amplified using appropriate primers and 
standard PCR conditions. Design of primers and optimal amplification conditions 
10 constitute standard techniques which are commonly known in the art. 

An amplified NPC1L1 gene may conveniently be expressed, again, 
using methods which are commonly known in the art. For example, NPC1L1 may be 
inserted into a pETVbased plasmid vector (Stratagene; La Joola, CA), downstream of 
the T7 RNA polymerase promoter. The plasmid may then be transformed into a T7 

» 

15 expression system (e.g., BL21DE3 E.coli cells), grown in a liquid culture and induced 
(e g., by adding IPTG to the bacterial culture). 

Example 2: Direct Binding Assay. 

Membrane preparation : Caco2 cells transfected with an expression 
vector containing a polynucleotide encoding NPC1L1 (e.g. y SEQ ID NO: 2, 4 or 12) 

20 are harvested by incubating in 5 mM EDTA/phosphate-buffered saline followed by 
repeated pipeting. The cells are centrifuged 5 min at 1000 x g. The EDTA/PBS is 
decanted and an equal volume of ice-cold 50mM Tris-HCl, pH 7.5 is added and cells 
are broken up with a Polytron (PT10 tip, setting 5, 30 sec). Nuclei and unbroken cells 
are sedimented at 1O00 x g for 10 min and then the supernatant is centrifuged at 

25 50,000 x g for 1 0 min. The supernatant is decanted, the pellet is resuspended by 
Polytron, a sample is taken for protein assay (bicinchoninic acid, Pierce), and the 
tissue is again centrifuged at 50,000 x g. Pellets are stored frozen at -20°C. 

Binding assay : For saturation binding, four concentrations of f 3 !!]- 
ezetimibe (15 Ci/mmol) are incubated without and with 10" 5 M ezetimibe in triplicate 

30 with 50 ng of membrane protein in a total volume of 200 jil of 50 mM Tris-HCl, pH 
7.5, for 30 min at 30°C. Samples are filtered on GF/B filters and washed three times 
with 2 ml of cold Tris buffer. Filters are dried in a microwave oven, impregnated 

-55- 



WO 2005/069900 



PCT/US2005/001469 



with Meltilex wax scintillant, and counted at 45% efficiency. For competition 
binding assays, five concentrations of a sample are incubated in triplicate with 1 8 nM 
[^HJ-ezetimibe and 70 jj.g of membrane protein under the conditions described above. 
Curves are fit to the data with Prism (GraphPad Software) nonlinear least-squares 
5 curve-fitting program and Ki values are derived from IC50 values according to Cheng 
and Prusoff (Cheng, Y. C, et al, (1973) Biochem. Pharmacol. 22: 3099-3108). 

Example 3A: SPA Assay. 

For each well of a 96 well plate, a reaction mixture of 10 pg human, 
mouse or rat NPC1L1-CHO overexpressing membranes (Biosignal) and 200 fig/well 

10 YSi-WGA-SPA beads (Amersham) in 100 jj is prepared in NPC1L1 assay buffer 
(25 mM HEPES, pH 7.8, 2 mM CaCl 2 , ImM MgCl 2 , 125 mM NaCl, 0.1% BSA). A 
0.4 nM stock of Hgand- [ 125 I]-ezetimibe- is prepared in the NPC1L1 assay buffer. The 
above solutions are added to a 96-well assay plate as follows: 50 NPC1L1 assay 
buffer, 100 jol of reaction mixture, 50 pi of Hgand stock (final ligand concentration is 

15 0. 1 nM). The assay plates are shaken for 5 minutes on a plate shaker, then incubated 
for 8 hours before cpm/well are determined in Microbeta Trilux counter 
(PerkinElmer). 

These assays will indicate that [ 125 I]-ezetimibe binds to the cell 
membranes expressing human, mouse or rat NPC 1 LI . Similar results will be 
20 obtained if the same experiment is performed with radiolabeled cholesterol (e.g. 9 
l25 I-cholesterol). 

Example 3A: Alternate SPA Assay. 

The final concentrations should be: 1 nM 3s S-2 (Km ~ 2-5 nM, 
~50,000 dpm/assay); 

25 1 p.g membranes (-1-2 nM receptor); 0.007%-0.03% taurocholate 

(0.140 pi 1% stock); 0.01 0%-0.05% digitonin (0.200 pi 1% stock); 5% DMSO (1.00 
pi inhibitors). 

In each well of a 96 well plate is put. 1 pi DMSO inhibitor solution, and 
£hen the radioligand and detergents are added as a 2X solution in 10 pi buffer A. 
30 Shake for a minute to be sure the inhibitor and ligand are mixed, before initiating with 
9 pi diluted receptor solution in buffer A. After shaking again, the plate is incubated 
at 37°C for 2 hours. Then WGA beads (0.3 mg) are added as a 3 pi suspension in 
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buffer A, then shake for 30 minutes. Similar results are obtained if membranes are 
pre-incubated with beads for 30 minutes before adding ligands. Finally, dilute to 
300 |xl with buffer A, cover the plate, spin at 3,000 rpmx5 min, and read at 2 minutes 
per well in the "Microbeta" counter. 

5 Stocks 

Ligand : 35 S-2 is 525.42 nM, 0616 jiCi/^il, in acetonitrile; spec, act 
- 3.8916 X 10- 4 finol/dpm; 1 168 Ci/mmol) 

Membranes : 3 rd batch recombinant human expressed in 
HEK-293 cells; 20.2 ug/\d stock; -20-40 pmol NPClLl/mg protein 
1 0 Buffer A : 26 mM NaHC0 3 ; 0.96 mN NaH 2 P0 4 ; 5 mM HEPES; 

optional addition of 5.5 mM glucose; 1 17 mM NaCl; 5.4 mM KC1 

Example 4: Cholesterol Uptake Assay. 

CHO cells expressing either SR-B1 or three different clones of rat 
NPC1L1 or one clone of mouse NPC1 LI were starved overnight in cholesterol free 

1 5 media then dosed with [3H]-cholesterol in a mixed synthetic micelle emulsion for 4 
min, 8 min, 12 rani or 24 min in the absence or presence of 10 OM ezetimibe. The 
cells were harvested and the lipids were organically extracted. The extracted lipids 
were spotted on thin-layer chromatography (TLC) plates and resolved within an 
organic vapor phase. The free cholesterol bands for each assay were isolated and 

20 counted in a scintillation counter. 

The SR-B1 expressing cells exhibited an increase in [ 3 H]-cholesterol 
uptake as early as 4 min which was also inhibited by ezetimibe. The three rat clones 
and the one mouse clone appeared to give background levels of [ 3 H]-cholesterol 
uptake which was similar to that of the untransformed CHO cell. 

25 These experiments will yield data demonstrating that CHO cells can 

perform mouse, rat and human NPC 1 L 1 -dependent uptake of [ 3 H]-cholesterol when 
more optimal experimental conditions are developed. 

Example 5: Expression of Rat NPC1X1 in Wistar Rat Tissue. 

In these experiments, the expression of rat NPC1L1 mRNA, in several 
30 rat tissues, was evaluated. The tissues evaluated were esophagus, stomach, 

duodenum, jejunum, ileum, proximal colon, distal colon, liver, pancreas, heart, 
aorta, spleen, lung, kidney, brain, muscle, testes, ovary, uterus, adrenal gland and 
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thyroid gland Total RNA samples were isolated from at least 3 male and 3 female 
animals and pooled. The samples were then subjected to real time quantitative 
PCR using Taqman analysis using standard dual-labeled fluorogenic 
oligonucleotide probes. Typical probe design incorporated a 5* reporter dye 
5 (e.g., 6FAM (6-carboxyfluorescein) or VIC) and a 3' quenching dye (e.g. 9 TAMRA 
(6-carboxytetramethyl-rhodamine)). 
rat NPC1L1 : 

Forward: TCTTCACCCITGCTCTTTGC (SEQ ID NO: 14) 
Reverse: AATGATGGAGAGTAGGTTGAGGAT (SEQ ID NO: 15) 

10 Probe: [6F AM]TGCCCACCTTTGTTGTCTGCTACC [TAMRA] 

(SEQ ID NO: 16) rat ft-actin : 

Forward: ATCGCTGACAGGATGCAGAAG (SEQ ID NO: 17) 
Reverse: TCAGGAGGAGCAATGATCTTGA (SEQ ID NO: 18) 
Probe: [yiC]AGATTACTGCCCTGGCTCCTAGCACCAT[TAMRA] 

15 (SEQ ID NO: 19) 

PCR reactions were run in 96-well format with 25 )d reaction mixture 
in each well containing: Platinum SuperMix (12.5 ^1), ROX Reference Dye (0.5 ^1), 
50 mM magnesium chloride (2 ^il), cDNA from RT reaction (0.2 ^1). Multiplex 
reactions contained gene specific primers at 200 nM each and FAM labeled probe at 

20 100 nM and gene specific primers at 100 nM each and VIC labeled probe at 50 nM. 
Reactions were run with a standard 2-step cycling program, 95°C for 15 sec and 60°C 
for 1 min, for 40 cycles. 

The highest levels of expression were observed in the duodenum, 
jejunum and ileum tissue. These data indicate that NPC1L1 plays a role in cholesterol 

25 absorption in the intestine. 

Example 6: Expression of Mouse NPC1L1 in Mouse Tissue. 

In these experiments, the expression of mouse NPC1L1 mRNA, in 
several tissues, was evaluated. The tissues evaluated were adrenal gland, BM, brain, 
heart, islets of langerhans, LI, small intestine, kidney, liver, lung, MLN, PLN, muscle, 
30 ovary, pituitary gland, placenta, Peyers Patch, skin, spleen, stomach, testes, thymus, 
thyroid gland, uterus and trachea. Total RNA samples were isolate from at least 3 
male and 3 female animals and pooled. The samples were then subjected to real time 
quantitative PCR using Taqman analysis using the following primers and probes: 
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mouse NPC1L1 : 

Forward: ATCCTCATCCTGGGCTTTGC (SEQ ID NO: 20) 

Reverse: GCAAGGTGATCAGGAGGTTGA (SEQ ID NO: 21) 

Probe: [6FAM]CCCAGCTTATCCA. 
5 (SEQ ID NO: 22) 

The highest levels of expression were observed in the Peyer*s Patch, 
small intestine, gall bladder and stomach tissue. These data are consistent with a 
cholesterol absorption role for NPC1 LI which takes place in the digestive system. 

Example 7: Expression of Human NPC1L1 in Human Tissue. 

10 In these experiments, the expression level of human NPC1L1 mRNA 

was evaluated in 2045 samples representing 46 normal tissues. Micro array-based 
gene expression analysis was performed on the AfFymetrix HG-U95 GeneChip using 
a cRNA probe corresponding to base pairs 4192-51 17 (SEQ ID NO: 43) in strict 
accordance to Affymetrix's established protocols. Gene Chips were scanned under 

1 5 low photo multiplier tube (PMT), and data were normalized using either Asymetrix 
MAS 4.0 or MAS 5.0 algorithms. In addition "spike ins" for most samples were used 
to construct a standard curve and obtain RNA concentration values according Gene 
Logic algorithms and procedures. A summary of these results are indicated, below, in 
Table 2. 
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Table 2. Expression level of NPC1L1 mRNA in various human tissues. 
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Shaded data corresponds to tissues wherein the highest levels of 
NPC1L1 mRNA was detected. The "Present" column indicates the proportion of 
specified tissue samples evaluated wherein NPC1L1 mRNA was detected. The 
5 "Absent" column indicates the proportion of specified tissue samples evaluated 
wherein NPC1L1 RNA was not detected. The "lower 25%", "median" and "upper 
75%" columns indicate statistical distribution of the relative NPC1L1 signal 
intensities observed for each set of tissue evaluated. 

Example 8: Distribution of Rat NPC1L1, Rat EBAT or 
1 0 Rat SR-B1 mRNA In Rat Small Intestine. 

In these experiments, the distribution of rat NPC1L1 mRNA along the 
proximal-distal axis of rat small intestines was evaluated. Intestines were isolated 
from five independent animals and divided into 10 sections of approximately equal 
length. Total RNA was isolated and analyzed, by real time quantitative PCR using 
15 Taqman analysis, for localized expression levels of rat NPC1L1, rat IBAT (ileal bile 
acid transporter) or rat SR-B1 mRNA. The primers and probes used in the analysis 
were: 
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rat NPC1L1 : 

Forward: TCTTCACCCITGCTCTTTGC (SEQ ID NO: 23) 
Reverse: AATGATGGAGAGTAGGTTGAGGAT (SEQ ID NO: 24) 
Probe: [6FAM]TGCCCACCTTTGTTGTCTGCTACC[TAMRA] 
5 (SEQ ID NO: 25) rat Villin : 

Forward: AGCACCTGTCCACTGAAGATTTC (SEQ ID NO: 26) 

Reverse: TGGACGCTGAGCTTCAGTTCT (SEQ ID NO: 27) 

Probe: [VIC]CTTCTCTGCGCTGCCTCGATGGAA[TAMRA] (SEQ 
ID NO: 28) ratffl?-Bl : 

10 Forward: AGTAAAAAGGGCTCGCAGGAT (SEQ ID NO: 29) 

Reverse: GGCAGCTGGTGACATCAGAGA (SEQ ID NO: 30) 
Probe: [6FAM]AGGAGGCCATGCAGGCCTACTCTGA[TAMRA] 
(SEQIDNO:31)raLS4£ 

Forward: GAGTCCACGGTCAGTCCATGT (SEQ ID NO: 32) 

15 Reverse: TTATGAACAACAATGCCAAGCAA (SEQ ID NO: 33) 

Probe: [6FAMJAGTCCITAGGTAGTGGCTTAGTCCCTGGAAGC 
TC[TAMRA] (SEQ ID NO: 34) 

The mRNA expression levels of each animal intestinal section were 
analyzed separately, then the observed expression level was normalized to the 
20 observed level of villin mRNA in that intestinal section. The observed, normalized 
mRNA expression levels for each section where then averaged. 

The expression level of NPC1L1 and SR-B1 were highest in the 
jejunum (sections 2-5) as compared to that of the more distal ileum sections. Since 
the jejunum is believed to be the site of cholesterol absorption, these data suggest 
25 such a role for rat NPC ILL IB AT distribution favoring the ileum is well document 
and served as a control for the experiment. 

Example 9: In situ Analysis of Rat NPC1L1 mRNA in Rat Jejunum Tissue. 

The localization of rat NPC1L1 mRNA was characterized by in situ 
hybridization analysis of rat jejunum serial sections. The probes used in this analysis 
30 were: 

T7-sense probe : GTAATACGACTCACTATAGGGCCCTGACGGT 
CCTTCCTGAGGGAATCTTCAC (SEQ ID NO: 35) 
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T7-antisensR p robe: GTAATACGACTCACTATAGGGCCTGGGAA 
GTTGGTCATGGCCACTCCAGC (SEQ ID NO: 36) 

The RNA probes were synthesized using T7 RNA polymerase 
amplification of a PCR amplified DNA fragment corresponding rat NPC1L1 
5 nucleotides 33 1 8 to 3672 (SEQ ID NO 1). Sense and anti-sense digoxigenirt-UTP 
labeled cRNA probes were generated from the T7 promoter using the DIG RNA 
Labeling Kit following the manufacturer's instructions. Serial cryosections rat 
jejunum were hybridized with the sense and antiisense probes. Digoxigenin labeling 
was detected with the DIG Nucleic Acid Detection Kit based on previous methods. A 
1 0 positive signal is characterized by the deposition of a red reaction product at the site 
of hybridization. 

The anti-sense probe showed strong staining of epithelium along the 
crypt-villus axis under low magnification (40X). The observed rat NJPC1L1 mRNA 
expression levels may have been somewhat greater in the crypts than in the villus tips. 
1 5 Under high magnification (200X), staining was observed in the enterocytes but not in 
the goblet cells. A lade of staining observed with the sense probe (control) confirmed 
the high specificity of the NPC1L1 anti-sense signal. These data provided further 
evidence of the role of rat NPC1L1 in intestinal cholesterol absorption. 

Example 10: FACS Analysis of FluorescentiLy Labeled 
20 Ezetimibe Binding to Transiently Transfected CHO Cells. 

In these experiments, the ability of BODIPY-labeled ezetimibe 

(Altmann, et al, (2002) Biochim. Biophys. Acta 1580(1): 77-93) to bind to NPC1L1 

and SR-B1 was evaluated. "BODIPY" is a fluorescent group which was used to 

detect the BODIPY-ezetimibe. Chinese hamster ovary (CHO) cells were transiently 

25 transfected with rat NPC1L1 DNA (rNPCl Ll/CHO), mouse NPC1LI DNA 

(mNPClLl/CHO), mouse SR-B1 DNA (mSRBI/CHO) or EGFP DNA (EGFP/CHO). 
EGFP is enhanced green fluorescent protein which was used as a positive control. 
The transfected CHO cells or untransfected CHO cells were then stained with 100 nM 
BODIPY-labeled ezetimibe and analyzed by FACS. Control experiments were also 

30 performed wherein the cells were not labeled with the BODIPY-ezetiniibe and 
wherein untransfected CHO cells were labeled with the BODIPY-ezetimibe. 

No staining was observed in the untransfected CHO, rNPClLl/CHO 
or mNPClLl/CHO cells. Fluorescence was detected in the positive-control 
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EGFP/CHO cells. Staining was also detected in the mouse SR-B1/CHO cells. These 
data show that, under the conditions tested, BODIPY-ezetimibe is capable of binding 
to SR-B1 and that such binding is not ablated by the presence of the fluorescent 
BODIPY group. When more optimal conditions are determined, BODIPY-ezetimibe 
will be shown to label the rNPClLl/CHO and mNPClLl/CHO cells. 

Example 11: FACS Analysis of Transiently Transfected 





r^ 1 
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In these experiments, the expression of FLAG- tagged NPC1L1 on 
CHO cells was evaluated. CHO cells were transiently transfected with mouse 

1 0 NPC1L1 DNA, rat NPC1L1 DNA, FLAG- rat NPC1L1 DNA or FLAG- mouse 

NPC1L1 DNA The 8 amino acid FLAG tag used was DYKDDDDK (SEQ ID NO: 
37) which was inserted on the ammo-terminal extracellular loop just past the secretion 
signal sequence. The cells were incubated with commercially available anti-FLAG 
monoclonal mouse antibody M2 followed by a BODIPY-tagged anti-mouse 

1 5 secondary antibody. The treated cells were then analyzed by FACS. 

The M2 antibody stained the CHO cells transfected with FLAG-rat 
NPC1L1 DNA and with FLAG-mouse NPC1L1 . No staining was observed in the 
CHO cells transfected with mouse NPC1L1 DNA and with rat NPC1L1 DNA. These 
data showed that rat NPC1L1 and mouse NPC1L1 possess no significant, inherent 

20 fluorescence and are not bound by the anti-FLAG antibody. The observed, FLAG- 
dependent labeling of the cells indicated that the FLAG-mouse NPC1L1 and FLAG- 
rat NPC1L1 proteins are localized at the cell membrane of the CHO cells. 

Example 12: FACS Analysis of FLAG-rat NPC1L1-EGFP Chimera 
in Transiently Transfected CHO Cells. 

25 In these experiments, the surface and cytoplasmic localization of rat 

NPC1L1 in CHO cells was evaluated. CHO cells were transiently transfected with 
FLAG- rat NPC1L1 DNA or with FLAG-rat NPC1L1-EGFP DNA In these fusions, 
the FLAG tag is at ammo-terminus of rat NPC1L1 and EGFP fusion is at the carboxy- 
terminus of rat NPC1L1. The cells were then stained with the M2 anti-FLAG mouse 

30 (primary) antibody followed by secondary staining with a BODIPY-labeled anti- 
mouse antibody. In control experiments, cells were stained with only the secondary 
antibody and not with the primary antibody (M2). The stained cells were then 
analyzed by FACS. 

■ 
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In a control experiment, FL^VG-rat NPC1L1 transfected cells were 
stained witb BODIPY anti-mouse secondary antibody but not with the primary 
antibody. The data demonstrated that the secondary, anti-mouse antibody possessed, 
no significant specificity for FLAG-rat NPC1L1 and that the FLAG-rat NPC1L1, 
5 itself, possesses no significant fluorescence?. 

In another control experiment, unlabeled FLAG-rat NPC1L1-EGFP 
cells were FACS analyzed. In these experiments, auto fluorescence of the enhanced 
green fluorescent protein (EGFP) was detected. 

FLAG-rat NPC1 LI cells wexe stained with anti T FLAG mouse antibody 
1 0 M2 and with the BODIPY-labeled anti-mouse secondary antibody and FACS 

analyzed. The data from this analysis showed that the cells were labeled with the 
secondary, BODIPY-labeled antibody whi&h indicated expression of the FLAG-rat 
NPC1L1 protein on the surface of the CHO cells. 

FLAG-rat NPC1 LI -EGFP cells were stained with anti-FLAG mouse 
1 5 antibody M2 and with the BODIPY-labeled anti-mouse secondary antibody and 

FACS analyzed. The data from this analysis showed that both markers (BODIPY and 
EGFP) were present indicating surface expression of the chimeric protein. The data 
also indicated that a portion of the protein w^as located within the cells and may be 
associated with transport vesicles. These data supported a role for rat NPC1L1 in 
20 vesicular transport of cholesterol or protein expressed in subcellular organelles such 
as the rough endoplasmic reticulum. 

Example 13: FACS Analysis and Fluorescent Microscopy of ' 
FLAG-rat NPC1L1-EGFP Chimera in a Cloned CHO Cell Line. 

In these experiments, the cellular localization of rat NPC1L1 was 

25 evaluated by FACS analysis and by immunolustochemistry. CHO cells were 

transfected with FLAG-rat NPC1L1-EGFP DNA and stained with anti-FLAG mouse 

antibody M2 and then with a BODIPY-labeled anti-mouse secondary antibody, hi the 

fusion, the FLAG tag is at the amino-terminias of rat NPC1L1 and the enhanced green 

fluorescent protein (EGFP) tag is located at tiie carboxy-tenninus of the rat NPC1L1. 

30 The stained cells were then analyzed by FACS and by fluorescence microscopy. 

Cells transfected with FLAG— xat NPC1L1-EGFP DNA were stained 

with the anti-FLAG mouse antibody M2 and then with the BODIPY-labeled anti- 
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mouse secondary antibody. FACS analysis of the cells detected both markers 
indicating surface expression of the chimeric protein. 

FLAG-rat NPC1L1-EGFP transfected cells were analyzed by 
fluorescent microscopy at 63X magnification. Fluorescent microscopic analysis of 
the cells indicated non-nuclear staining with significant perinuclear organelle staining. 
Resolution of the image could not confirm the presence of vesicular associated 
protein. These data indicated that the fusion protein was expressed on the cell 
membrane of CHO cells. 

Example 14: Generation of Polyclonal Anti-rat NPCIL1 Rabbit Antibodies. 

Synthetic peptides (SEQ ID NO: 39-42) containing an amino- or 
caiboxy^erminal cysteine residue were coupled to keyhole limpet hemocyanin (KLH) 
carrier protein through a disulfide linkage and used as antigen to raise polyclonal 
antiserum in New Zealand white rabbits (range 3-9 months in age). The KLH-peptide 
was emulsified by mixing with an equal volume of Freund's Adjuvant, and injected 
into three subcutaneous dorsal sites. Prior to the 16 week imm unization schedule a 
pre-immune sera sample was collected which was followed by a primary injection of 
0.25 mg KLH-peptide and 3 scheduled booster injections of 0.1 mg KLH-peptide. 
Animals were bled from the auricular artery and the blood was allowed to clot and the 
serum was then collected by centrifugation 
20 The anti-pep tide antibody titer was determined with an enzyme linked 

immunosorbent assay (ELISA) with free peptide bound in solid phase (ljag^well). 
Results are expressed as the reciprocal of the serum dilution that resulted in an OD 450 
of 0.2. Detection was obtained using the biotinylated anti-rabbit IgG, horse radish 
peroxidas^-streptavidin (HRP-SA) conjugate, and ABTS. 

25 Example 15: FACS Analysis of Rat NPC1L.1 Expression in 
CHO Cells Transiently Transfected with Rat NPC1L1 DNA 
Using Rabbit Anti-rat NPC1L1 Antisera. 

In these experiments, the expression of rat NPC1 LI on the surface of 

CHO cells was evaluated. CHO cells were transfected with rat NPC1L1 DNA, then 

30 incubated with either rabbit preimmune serum or with 1 0 week anti-rat NPC1 LI 

serum described, above, in Example 14 (i.e., A0715, A0716, A0867 or A0868). Cells 

labeled with primary antisera were then stained with a BODIPY-modified anti-rabbit 

secondary antibody followed by FACS analysis. 
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No antibody surface labeling was observed for any of the pre-immune 
sera samples. Specific cell surface labeling of rat NPC1L1 transfected cells was 
observed for both A0715 and A0868. Antisera A071 6 and A0867 did not recognize 
rat NPC1L1 surface expression in this assay format. This indicates that the native, 
5 unfused rat NPC1L1 protein is expressed in the CHO cells and localized to the CHO 
cell membranes. Cell surface expression of NPC1L1 is consistent with a role in 
intestinal cholesterol absorption. 

Example 16: FACS Analysis of CHO Cells Transiently Transfected 
with FLAG-Mouse NPC1L1 DNA or FLAG-rat NPC1L1 DNA or 
10 Untransfected CHO Cells Using Rabbit Anti-rat NPC1L1 Antisera. 

In these experiments, the expression of FLAG-mouse NPC1L1 and 
FLAG-rat NPC1L1 in CHO cells was evaluated. CHO cells were transiently 
transfected with FLAG-mouse NPC1L1 DNA or with FLAG-rat NPC1L1 DNA. The 
FLAG-mouse NPC1L1 and FLAG-rat NPC1L1 transfected cells were labeled with 

15 either A0801,A0802,A071 5 or A0868 sera (see Example 14) or with anti-FLAG 
antibody, M2. The labeled cells were then stained with BODEPY-labeled anti-rabbit 
secondary antibody and FACS analyzed. The untransfected CHO cells were analyzed 
in the same manner as the transfected cell lines. 

Positive staining of the untransfected CHO cells was not observed for 

20 any of the antisera tested. Serum A0801 -dependent labeling of FLAG-rat NPC1L1 
transfected cells was observed but such labeling of FLAG-mouse NPC1L1 transfected 
cells was not observed. Serum A0802-dependent labeling of FLAG-mouse NPC1L1 
or FLAG-rat NPC1L1 transfected cells was not observed. Strong serum A0715- 
dependent labeling of FLAG-rat NPC1L1 transfected cells was observed and weak 

25 serum A0715-dependent labeling of FLAG-mouse NPC1L1 transfected cells was 
observed. Weak serum A0868-deperident labeling of rat NPC1L1 and mouse 
NPC1L1 transfected cells was observed Strong Anti-FLAG M2 antibody-dependent 
labeling of FLAG-rat NPC1 LI and FLAG-mouse NPC1 LI transfected cells was 
observed. The strong M2 staining is likely to be due to the fact that M2 is an affinity- 

30 purified, monoclonal antibody of known concentration. In contrast, the respective 
antisera are polyclonal, unpurified and contain an uncertain concentration of anti-rat 
NPC1L1 antibody. These date provide further evidence that the FLAG-mouse 
NPC1L1 and FLAG-rat NPC1L1 proteins are expressed in CHO cells and localized to 
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the CHO cell membranes. Cell surface expression of NPC1L1 is consistent with a 
role in intestinal cholesterol absorption. 

Example 17: Immunohistochemical Analysis of Rat Jejunum Tissue 
with Rabbit Anti-rat NPC1L1 Antisera A0715. 

5 In these experiments, the localization ofratNPClLl in rat jejunum 

was analyzed by immunohistochemistry. Rat jejunum was removed, immediately 
embedded in O.C.T. compound and frozen in liquid nitrogen. Sections (6pm) were 
cut with a cryostat microtome and mounted on glass slides. Sections were air dried at 
room temperature and then fixed in Bouin's fixative. Streptavidin-biotin-peroxidase 

10 immunostaining was carried out using Histostain-SP kit. Endogenous tissue 

peroxidase activity was blocked with a 10 minute incubation in 3% H2O2 in methanol, 
and nonspecific antibody binding was minimized by a 45 minute incubation in 10% 
nonimmune rabbit serum. Sections were incubated with a rabbit anti-rat NPC1L1 
antisera A071 5 or A0868 at a 1 : 500 dilution at 4°C, followed by incubation with 

1 5 biotinylated goat anti-rabbit IgG and with streptavidin-peroxidase. Subsequently, the 
sections wore developed in an aminoethyl carbazole (AEQ-KfeQi staining system and 
counterstained with hematoxylin and examined by microscopy. A positive reaction 
using this protocol is characterized by the deposition of a red reaction product at the 
site of the antigen-antibody reaction. Nuclei appeared blue from the hematoxylin 

20 counterstain. Controls were performed simultaneously on the neighboring sections 
from the same tissue block. Control procedures consisted of the following: (1) 
substitute the primary antibody with the pre-immune serum, (2) substitute the primary 
antibody with the non-immune rabbit serum, (3) substitute the primary antibody with 
PBS, (4) substitute the second antibody with PBS. 

25 The example shows tissue stained with anti-rat NPC 1 LI sera A07 1 5 or 

with the preimmune sera analyzed at low magnification (40X) and at high 
magnification (200X). The A0715-stained tissue, at low magnification, showed 
positive, strong staining of the villi epithelial layer (enterocytes). The A0715-stained 
tissue at high magnification showed positive, strong staining of the enterocyte apical 

30 membranes. No staining was observed in tissue treated only with preimmune sera. 
Similar results were obtained with sera A0868. These data indicate that rat NPC1 LI 
is expressed in rat jejunum which is consistent with a role in intestinal cholesterol 
absorption. 
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Example 18: Labeled Cholesterol Uptake Assay. 

In this example, the ability of CHO cells stably transfected with rat 
NPC1L1 to take up labeled cholesterol was evaluated. In these assays, cholesterol 
uptake, at a single concentration, was evaluated in a pulse-chase experiment. The 
5 data generated in these experiments are set forth, below, in Table 3 . 

Cells: 

A. CHO cells stably transfected with rat NPC1L1 cDNA 

B. CHO background (no transfection) 

Cells were seeded at 500,000 cells/ well (mL) in 12-well plates. 

10 Procedure: 

All reagents and culture plates were maintained at 37°C unless 
otherwise noted. 

Starve. The maintenance media (F 12 HAMS, l%Pen/Strep, 10%FCS) 
was removed and the cells were rinsed with serum-free HAMS media. The serum- 
15 free media was then replaced with 1 mL "starve" media (F12 HAMS, Pen/Strep, 5% 
lipoprotein deficient serum (LPDS). 

One plate of each cell line was starved overnight. The remaining 2 
plates were designated "No Starve" (see below). 

Pre- Incubation. Media was removed from all plates, rinsed with 
20 serum-free HAMS and replaced with starve media for 30 minutes. 

3 H-Cholesterol Pulse. The following was added directly to each well. 
0.5nCi ^-cholesterol (-1.1 X 10 6 dpm/well) in 50^x1 of a 
mixed bile salt micelle. 

4.8 mM sodium taurocholate (2.581 mg/niL) 
25 0.6 mM sodium oleate (0.183 mg/mL) 

0.25 mM cholesterol (0. 1 mg/mL) 
Dispersed in "starve" media by ultrasonic vibration 
Final media cholesterol concentration = 5jig/mL 
Labeled cholesterol pulse time points were 0, 4, 12 and 24 minutes. 
30 Triplicate wells for each treatment were prepared. 

Wash- At the designated times, media was aspirated and the cells 
were washed once with Hobbs Buffer A (50mM Tris, 0.9% NaCl, 0.2% BSA, pH 7.4) 
and once with Hobbs Buffer B (50mM Tris, 0.9% NaCl, pH 7.4 (no BSA)) at 37°C. 
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Processing/Analysis. Cells were digested overnight with 0.2N NaOIH, 
2mL/well at room temperature. One 1 .5 mL aliquot was removed from each well, 
neutralized & counted for radioactivity by scintillation counting. Two additional SOjil 
aliquots from all wells are assayed for total protein by the Pierce micro BCA metho<I. 
The quantity of labeled cholesterol observed in the cells was normalized by the 
quantity of protein in the cells. 

Table 3. Uptake of 3H-cholesterol by CHO cells transfected with rat NPCJL1 or 
mouse SR-B1 or untransfected CHO cells. 



AftafeCboiestcral 



Ttatal Cholesterol* d 


pmprotelB±sesi 


NPC1L1 | 


CHO 



24 



12 



24 



Total Cholesterol, dmn/mg pnrt£hr±sem 



NFOL1 



CHO 



2067 ±46 



2619 £130 



2868 ±193 



7010 ±89 



No Starve 



4568 ±1937 



2868 ±193 



4459 ±170 



7204 ±173 



10754 ±166 



15366 ±938 



15636 ±1471 



41129 ±685 



22881 ±9230- 



15636 ±1471 



24622 ±966 



39361 ±1207" 



Starve 



1937 ±273 



3023 ±308 



2759 ±105 



6966 ±72 



2440 ±299 



2759 ±105 



4857 ±186 



7344 ±65 



10909 ±1847 



17278 ±1650 



14307 ±781 



39196 ±174 



12429 ±167$ 



14307 ±781 



26270 ±1473 



38381 ±161 



dpm=disintegrations per minute 
sem=3tandard error of the 
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Example 19: Effect of Ezetimibe on Cholesterol Uptake. 

The effect of ezetimibe on the ability of CHO cells stably transfected 
with mouse or rat NPC1L1 or mouse SR-B1 to take up 3 H-labeled cholesterol was 
evaluated in pulse-chase experiments. One cDNA clone of mouse NPC1L1 (C7) and 
three clones of rat NPC1L1 (C7, C17 and C21) were evaluated. The ability of CHO 
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cells stably transfected with mouse SR-B1, mouse NPC1L1 and rat NPC1L1 to take up 
labeled cholesterol, in the absence of ezetimibe, was also evaluated in the pulse-chase 
experiments. Data generated in these experiments are set forth, below, in Tables 4 
and 5. Additionally, the quantity of total cholesterol taken up by transfected and 
5 untransfected CHO cells in the presence of four different unlabeled cholesterol 
concentrations was also evaluated. The data from these experiments is set forth, 
below, in Table 6. 

Cells: 

A. CHO cells stably transfected with rat or mouse NPC1L1 cDNA 
10 B . CHO background (no transfection) 

C. SR-B1 transfected CHO cells 
Cells seeded at 500,000 cells / well (mL) in 12-well plates. 

Procedure: 

All reagents and culture plates were maintained at 37°C unless 
1 5 otherwise noted. 

Starve. The maintenance media (F12 HAMS, l%Pen/Strep, 10%FCS) 
was removed and the cells were rinsed with serum-free HAMS media. The serum- 
free media was then replaced with 1 mL "starve" media (F12 HAMS, Pen/Strep, 5% 
lipoprotein deficient serum (LPDS). The cells were then starved overnight 
20 Pre-Incubation/ pre-dose. Media was removed from all plates and 

replaced with fresh starve media and preincubated for 30 minutes. Half of the wells 
received media containing ezetimibe (stock soln in EtOH; final cone. = IOjiM). 

3 H-Cholesterol Pulse. The following was added directly to each well: 
O.SnCi 3 H-cholesterol (-1.1 X 106 dpm/well) in 50^1 of a 
25 mixed bile salt micelle 

4.8mM sodium taurocholate (2.581mg/mL) 
0.6 mM sodium oleate (0.183 mg/mL) 
0.25 mM cholesterol (0. 1 mg/mL) 
Dispersed in "starve" media by ultrasonic vibration 
30 Final media cholesterol concentration = 5^ig/mL 

Labeled cholesterol pulse time points were 4, 12, 24 minutes and 4 
hours. Triplicate wells were prepared for each treatment. 
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Wash. At designated times, media was aspirated and cells were 
washed once with Hobbs Buffer A (50mM Tris, 0.9% NaCl, 0.2% bovine serum 
albumin (BSA), pH 7.4) and once with Hobbs Buffer B (50mM Tris, 0.9% NaCl, 
pH 7.4 (no BSA)) at 37°C. 
5 Processing/Analysis. 

A. 4. 12. 24 m inute tim e points : Cells were digested overnight 
with 0.2N NaOH, 2mL/well, room temperature. One 1.5 mL aliquot was removed 
from each well, neutralized & counted for radioactivity by scintillation counting. 

B. 4 hour time point : The digested cells were analyzed by thin- 
10 layer chromatography to determine the content of cholesterol ester in the cells. 

Extracts were spotted onto TLC plates and run for 30 minutes in 2 ml 
hexane: isopropanol (3: 2) mobile phase for 30 minutes, followed by a second run in 
1 ml hexane: isopropanol (3: 2) mobile phase for 15 minutes. 

C. Protein determination of cell extracts . Plates containing a 
1 5 sample of the cell extracts were placed on orbital shaker at 120 rpm for indicated 

times and then extracts are pooled into 12 X 75 tubes. Plates were dried and NaOH 
(2ml/well) added. The protein content of the samples were then determined. Two 

additional 50fil aliquots from all wells were assayed for total protein by the Pierce 

« 

micro BCA method. The quantity of labeled cholesterol observed in the cells was 
20 normalized to the quantity of protein in the cells. 
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Table 4. Total Cholesterol in Transfected CHO Cells in the Presence and 
Absence of 

Ezetimibe. 



Tbta) Ottlesferol, dpm± sem 



V^MHa 1 EZflQuM) 
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28675 ±527 

27285 ±968 











12 Mb) Fata 








(BO astro] 


| 4710 


±246 


4532 


±165 




44208 


±2702 


43391 


±1197 


S&Bl 


16970 


±763 


12349 


±298 




t40105 


±6523 


98956 


±4447 


mNPClLl(C7) 


! 6316 


±85 


6120 


±755 




45133 


±342 


41712 


44054 


iwpau(ca) 


5340 


±12 


4703 


±231 




40018 


±1181 


33985 


±1928 




4831 


£431 


4579 


±257 




37378 


±3461 


34063 


±1619 


iWOLIfCT) 


4726 


±272 


4664 


' ±63 




39100 


±2350 


38581 


±784 











24 Mto Poise 








CHOGffitm 


7367 


±232 


6678 


±215 




65843 


±1281 


61764 


±2131 


SR-BJ 


39166 


±2152 


23558 


±1310 




324126 


±11848 


198725 


±11713 


«JNPC1L1(C7) 


10616 


±121 


9749 


±482 




77222 


±1040 


74041 


±3670 


rNPOLl(C21) 


9940 


±587 


8760 


±293 




76356 


±9618 


66165 


±2181 




8728 


±721 


8192 


±237 




70509 


±5189 


62279 


±1352 


IWC1U{C7^J 


8537 


±148 


7829 


±204 




72134 


±1305 


63482 


±368 , 



EZ = ezetimibe 
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Table 5. Cholesterol Ester in CHO cells in the Presence or Absence of Ezetimibe. 



Cholesteryl Ester, dpm ± rem 
VcMcIe 1 EZOOhM) 



Cfcolesteiyi Estei 



Vehicle 



[ EZflOuM) 



CHOGmtrol 


652 


±13 


208 


±9 


SR-HI 


476(3 
732 

* 


±1292 
±127 




±401 


mNPOLl(C7) 


453 


±118 


rNPOLl(C21) 


2667 


±90 


454 


±33 


nNPOLl(CI7) 


751 


±74 


202 


±10 


riSPOLUCT) 


462 


425 


191 


±54 



5647 ±55 

391067 ±14391 

4994 ±827 

18655 ±1032 

5379 ±481 

3597 ±193 



1902 ±87 

72782 43181 

3057 ±776 

3193 ±265 

1510 ±62 

1496 ±403 



CHO Control 
SR-BI 
mNPClLl(C7) 
rNPOLl(C21) 
rNPOLl(Ci7) 



Free Cholesterol, dpm ±sem 



VcWde 



HrceCholc 



m « * 



r-JiHPliUin'iii -i 



VeUde 



protein ±sem 



4 Hour Pulse 



61612 ±1227 

214678 ±4241 

79628 ±793 

71352 ±1343 

78956 ±3782 

75348 42093 



56792 ±568 

194519 4474 

77516 ±1910 

69106 ±711 

71646 4446 

70628 ±212 



EZ^ezetinribe 



533876 ±17770 

1762873 ±46607 

544661 ±1269 

498016 48171 

566456 429204 

586127 ±13932 



E&QQpM) 



519607 ±16203 
1521341 44185 



5: 



±10386 
485460 44410 
536651 ±7146 
556855 ±7481 
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Table 6. Uptake of labeled cholesterol in the presence of increasing amounts of 
unlabeled cholesterol. 



I 



CkUCholutml 
3|igAaL 

100p«/toL 



Total Chohaterol, dnm ± iem 

_ SR-BI 1 mNPftl f ,1 (C7\ I rNPClLlfCin I I CHO Control I 



I[ Tottl Chftt qtef flL dpm/mfl protein A icm 

I CHQCoptroLl . 8R-P1 I rottpcquctt 1 r? 



11271 *4M 49603 ±248 14230 41628 

162*2*2438 79967*8131 2M65 AJ037 

14738 ±1607 TIMS ASMS 19001 *tS30 

16468 *MM 58183 *4543 15973*1065 



16636 *1233 
13223*4596 
13218*1149 
11560*1153 



MMJoPoba 

108956*5413 



liildWflffll 



Oioterttw l jtod nm * ftp 



3pg/mL 

aOftsymL 

lOOpgftal 



SpgteL 
lOpgtal 
SOosfaL 

loo^eteL 



SltgAaL 

l0pg/mL 
JOpgftnL 
lOOpftaL 



2737 MM 
1646 *76 
970*45 
893*155 



39596 *I241 
17292*343 
6643 *I53 
4777*27 



541562 U3785 



94945 *12916 



140764*14435 

15 1283 ±13345 880224 482354 230983 *274ll 
133109 112106 796236*11952 160456*12112 llt522*694l 
149559*17977 630143*5711 147717*3261 101328*7191 
Chotatwrl Eiter. dpm/koa protein A tarn 



cniOMiBrvl Kiter. flpmrtoa protein A wm 
CHO Control I 5R-BI I mOTClLlfCn I rNPClLl 



1561 *l 
998*36 
537*12 
403*7 



4015*47 
1866*35 
970*9 
777*16 



4 Hoot Polie 

22050*975 



Fret CHolertcroi. dnm ± tan 



CHO Control I SR-BI 1 mNPClLUCTl rNPC3Ll/C21i 



13325*506 
7637*325 
7135 *1239 



382641 *5955 
157914 *3400 
63347*1760 
45088*1526 



13684 *217 
8917*467 
4885*748 
3663*58 



32020*64) 
14849*127 
7741 *100 
6003*196 



Prea Cbolettero^ tom/ag protein A ran 
CHO Control I SR-BI I mNFClLUCT I rNFClLHCSH 



89013 *3724 21 1783 *326B 104343 *2112 
136396 *»66 278216*10901 196173*4721 123144*877 
131745*2922 224429*2536 149172*19689 117143*4976 

79336 *4011 231470 *4221 114399 *2803 93538*1551 

» 

Chotnfcrvl Biter, dpro A tern 



4) Hour Pute 

717508*34130 2047693*16213 914107*5869 733498*11209 

1105118*76074 2540130*92471 1753072*16575 996824*27550 

1036195 *21I42 2149315*78061 1337136 *] 80264 934772*43202 

632963*29756 2182022 436793 1033979*50329 723223*21694 



CHO Control I SR-BI 



Chqtateryl Erter, dprnftng, protdn A «m 



, !£ 

CHO QmtrplT Slfcgl FmNPClLlfCT) 



57373*2704 162296*1644 

33730*1296 112815**373 

19193 *100 58668 *I413 

16761 *398 31280*1270 



22986*940 
14836*552 
8878 *955 
8784*946 



59377 *953 
31797*525 
18963 *380 
14933 *3U 



24 Hour Poise 

357629*14639 1248900*16565 



3p*/aL 
lOpgtaL 

tOOpg/mL 



Frta Cnolettcrol. rfom A rem 
8R-B1 ImWPCILIfCTll rNP ClLKCSl) 



215004 *5942 
122071 *127l 
103235 *1739 



830231 *12764 

446581*7472 

272796*15392 



160328 *6565 
9S594 *4205 
59091 42697 
60670*4597 



401315 *5557 
200451*5239 
119728*2131 
96315 *1033 



, Pre» CholCTterol (fomMw protein * icm 

CHO Control i SR-BI I mWPCIUfCn \ rNPCILIfClII 



248983*4207 357819*4519 283610*5187 227244*1016 

231208*1927 269822*5872 311777*8227 231666*6191 

203566*6008 225273*5932 279604*6612 209372 *33B6 

178424*2)79 167082*2211 229832*4199 182678*7709 



MHaerFaHe 

1552637*18954 2752957*24984 1993)56 *56968 1536023*10304 



1477414 ±15864 1984473*18420 2065980*25517 1461137 *58517 
1294878*41819 1716066*52581 1859476*29507 1321730*5452 
1099648*25160 1455799 *9585 1599244*78938 1177546*51191 



Example 20: Labeled Cholesterol Uptake Assay. 

In this example, the ability of CHO cells transiently transfected with 
rat NPC1L1 or mouse SR-BI to take up labeled cholesterol Jvas evaluated. Also 
evaluated was the ability of rat NPC1L1 to potentiate the ability of CHO cells 
transfected with mouse SR-BI to take up labeled cholesterol. In these assays, 
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cholesterol uptake, at a single concentration, was evaluated in pulse-chase 
experiments. The data generated in these experiments are set forth, below, in Table 7. 

Cells: 

A. CHO background cells (mock transfection). 
5 B. CHO cells transiently transfected with mouse SR-B 1 . 

C. CHO transiently transfected with rat NPC1L1 cDNAs 
(n=8 clones). 

Transiently transfected cells were seeded at 300,000 cells / well (mL) 
in 12-well plates. 

10 Procedure: 

All reagents and culture plates were maintained at 37°C unless 
otherwise noted. 

Starve. The maintenance media (F 12 HAMS, l%Pen/Strep, 10%FCS) 
was removed from the cells and replaced with 1 mL "starve" media (F12 HAMS, 
1 5 Pen/Strep, 5% lipoprotein deficient serum (LPDS). Cells were starved for 1 hour. 

^-Cholesterol Pulse. The following was added directly to each well. 
O.SjiCi ^-cholesterol (-1.1 X 106 dpm/well) in 50^1 of a 
mixed bile salt micelle. 

4.8mM sodium taurocholate (2.581mg/mL) 
20 0.6 xnM sodium oleate (0. 1 83 mg/mL) 

0.25 mM cholesterol (0.1 mg/mL) 
Dispersed in "starve" media by ultrasonic vibration 
Final media cholesterol concentration = Sjxg/mL 
Labeled cholesterol pulse time points were 24 Min and 4 hours. 
25 Triplicate wells for each treatment. 

Wash. At the designated times, media was aspirated and cells were 
washed once with Hobbs Buffer A (50mM Tris, 0.9% NaCl, 0.2% BSA, pH 7.4) and 
once with Hobbs Buffer B (50mM Tris, 0.9% NaCl, pH 7.4 (no BSA)) at 37°C. 

Processing/Analysis. 

30 A. 24 minute time point : Cells were digested overnigjht with 

0.2N NaOH, 2mL/well at room temp. One 1 .5 mL aliquot was removed from each 
well, neutralized & counted for radioactivity by scintillation counting. 
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B. 4 hour time point : The digested cells were analyzed by thin- 
layer chromatography to determine the content of cholesterol ester in the cells. 

The extracts were spotted onto thin layer chromatography plates and 
run in 2 ml hexane: isopropanol (3: 2) containing mobile phase for 30 minutes, 
followed by a second run in 1 ml hexane: isopropanol (3: 2) containing mobile phase 
for 15 min. 

C. Protein determination of cell extracts : Plates containing a 
sample of the cell extracts were placed on orbital shaker at 120 rpm for indicated 
times and then extracts are pooled into 12X75 tubes. Plates were dried and NaOH 
(2ml/well) added. The protein content of the samples were then determined. Two 
additional 50^1 aliquots from all wells were assayed for total protein by the Pierce 
micro BCA method. The quantity of labeled cholesterol observed in the cells was 
normalized to the quantity of protein in the cells. 
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Table 7. Labeled cholesterol uptake in transiently transfected CHO cells. 





Total Cholesterol, ±sem 




dpm 




dpnVrag protein 


Transfection 


24 IVfin Pulse 


CHO Control (mock) 


4721 ±436 




49024 ±4328 


SR-BI(Transient) 


5842 ±82 




59445 ±1099 


NPC1L1 (Transient) 


4092 ±377 




47026 ±2658 


SR-BI/NPCIL1 (trans) 


3833 ±158 




52132 ±3071 




Cholesteryl E 


ister,±sem 


* 


dpm 


! j dpm/ms protein 




4 Hour Poise 


CHO Control (mock) 


2132 ±40 




20497 ±640 


SR-BI(Transient) 


5918 ±237 




51812 ±1417 


NPC1L1 (Transient) 


1944 ±93 




19788 ±642 


SR-BI/NPC1L1 (trans) 


4747 ±39 




58603 ±1156 




Free Cholesterol, ± sem 




dpm 1 I dpnVme protein 




4 Hour Pulse 


CHO Control (mock) 


45729 ±328 




439346 ±5389 


SR-BI(Translent) 


50820 ±2369 




444551 ±9785 


NPC1L1 (Transient) 


39913 ±1211 




406615 ±6820 


SR-BI/NPC1L1 (trass) 


37269 ±1225 




459509 ±6195 



Example 21: Expression of rat, mouse and human NPC1L1. 

5 In this example, NPC1L1 was introduced into cells and expressed. 

Species specific NPC1L1 expression constructs were cloned into the plasmid 
pCDNA3 using clone specific PCR primers to generate the ORF flanked by 
appropriate restriction sites compatible with the polylinker of the vector. For all three 
species of NPC1L1, small intestine total tissue RNA was used as a template for 
1 0 reverse transcriptase-polymerase chain reaction (RT-PCR) using oligo dT as the 
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template primer. The rat NPC1L1 was cloned as an EcoRI fragment, human NPC1L1 
was cloned as a Xbal/NotI fragment and mouse NPC1L1 was cloned as an EcoRI 
fragment. Forward and reverse strand sequencing of each clone was performed to 
confirm sequence integrity. Standard transient transfection procedures were used 
5 with CHO cells. In a 6-well plate CHO cells were plated 1 day before transfection at 
a plating density of 2 X 10 s cells/well. The following day, cells were incubated with 
2 p.g plasmid DNA and 6 |iL Lipofectamine for 5 hours followed a fresh media 
change. Forty-eight hours later, cells were analyzed for NPC1L1 expression using 
anti-NPCl LI antisera by either FACS or western blot. To establish stable long term 

10 cell lines expressing NPC1L1, transfected CHO cells were selected in the presence of 
geneticin (G418, 0.8 mg/ml) as recommended by the manufacturer (life 
Technologies). Following one month of selection in culture, the cell population was 
stained with anti-NPClLl antisera and sorted by FACS. Individual positive staining 
cells were cloned after isolation by limiting dilution and then maintained in selective 

1 5 media containing geneticin (0.5 mg/ ml). 

Other cell types less susceptible to transfection procedures have been 
generated using adenoviral vector systems. This system used to express NPC1L1 is 
derived from Ad 5, a type C adenovirus. This recombinant replication-defective 
adenoviral vector is made defective through modifications of the El, E2 and E4 

20 regions. The vector also has additional modifications to the E3 region generally 
affecting the E3b region genes RIDa and RTDb. NPC1L1 expression was driven 
using the CMV promoter as an expression cassette substituted in the E3 region of the 
adenovirus. Rat and mouse NPC1 LI were amplified using clone specific primers 
flanked by restriction sites compatible with the adenovirus vector Adenovirus 

25 infective particles were produced from 293-D22 cells in titers of 5 X 1 0 10 P/mL. 

Viral lysates were used to infect cells resistant to standard transfection methodologies. 
In Caco2 cells, which are highly resistant to heterologous protein expression, 
adenovirus mediated expression of NPC1L1 has been shown by western blot analysis 
to persist at least 21 days post-infection. 

30 Example 22: NPC1L1 Knock-Out Transgenic Mouse. 

NPC1L1 knockout mice were constructed via targeted mutagenesis. 
This methodology utilized a targeting construct designed to delete a specific region of 
the mouse NPC1L1 gene. During the targeting process the E. coli lacZ reporter gene 
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was inserted under the control of the endogenous NPC1L1 promoter. The region in 
NPC1LI (SEQ ID NO: 45) being deleted is from nucleotide 790 to nucleotide 998. 
The targeting vector contains the LacZ-Neo cassette flanked by 1.9 kb 5' arm ending 
with nucleotide 789 and a 3.2 kb 3* arm starting with nucleotide 999. Genomic DNA 
5 from the recombinant embryonic stem cell line was assayed for homologous 

recombination using PCR. Amplified DNA fragments were visualized by agarose gel 
electrophoresis. The test PCRs employed a gene specific primer, which lies outside 
of and adjacent to the targeting vector arm, paired with one of three primers specific 
to the LacZ-Neo cassette sequence. For 5'. PCR reconfirmation, the NPCIL1 specific 
10 oligonucleotide ATGTTAGGTGAGTCTGAACCTACCC (SEQ ID NO: 46) and for 
3 'PCR reconfirmation the NPC1L1 specific oligonucleotide 

GGATTGCATTTCCTTCAAGAAAGCC (SEQ ID NO: 47) were used. Genotyping 
of the F2 mice was performed by multiplex PCR using the NPC1L1 specific forward 
primer TATGGCTCTGCCCTCTGCAATGCTC (SEQ ID NO: 48) the LacZ-Neo 

1 5 cassette specific forward primer TCAGCAGCCTCTGTTCCAC ATACACTTC (SEQ 
ID NO: 49) in combination with the NPC1L1 gene specific reverse primer 
GTTCCACAGGGTCTGTGGTGAGTTC (SEQ ID NO: 50) allowed for 
determination of both the targeted and endogenous alleles. Analysis of the PCR 
products by agarose gel electrophoresis distinguished the wild-type, heterozygote and 

20 homozygote null mouse from each other. 

Example 23: Acute Cholesterol Absorption in NPCILI-Deficient Mice. 

To determine whether NPC1 LI plays a role in cholesterol absorption, 
JNPCJLJ deficient mice were studied. 

Mice deficient in NPC1L1 (-/-) were generated by breeding 

25 heterozygote mice (+/) to obtain wild-type (+/+) and NPC1L1 deficient mice (-/-). 
Non-fasted mice (6.5-9 weeks old, mixed 129 and C57BL/6 background) were 
weighed and grouped (n=2 -A and n=4 +/+). All animals were gavaged (Feeding 
needles, 24G x 1 inch, Popper and Sons, NY) with 0.1 ml corn oil (Sigma; St Louis, 
MO) containing ljaCi 14 C-cholesterol (New England Nuclear, [ 4 " 14 C] Cholesterol, 

30 NEC-018) and O.lmg carrier cholesterol mass (Sigma; St. Louis, MO). Two hours 
later, blood was collected by heart puncture. The liver was removed, weighed, and 
three samples were placed into 20 ml counting vials. Tissues were digested in 1 ml of 
INNaOH at 60°C overnight The tissue digests were acidified by addition of 250|il 
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of 4N HC1 prior to liquid scintillation counting (LSC). Plasma was isolated by 
centrifogation at 10,000 rpm for 5 minutes in a microfuge and duplicate 100^1 
aliquots of plasma were taken for LSC. 

Cholesterol absorption, evaluated by this acute technique and 
5 expressed as the total amount of radioactive cholesterol in the plasma and liver, 
demonstrated that the wild type mice (+/+) absorbed an average of 1 1 ,773 dpm and 
NPC1L1 deficient mice absorbed 992 dpm of the 14C-cholesterol. These results 
indicate that the NPC1L1 deficient mice have a 92% reduction in cholesterol 
absorption. These data confirm the role of NPC1L1 in intestinal cholesterol 
10 absorption. Inhibition of NPClLl-mediated cholesterol absorption, in a subject, by 
administering NPC1L1 antagonists, such as ezetimibe, to the subject, are a useful way 
to reduce serum cholesterol levels and the occurrence of atherosclerosis in the subject 

Example 24: Cholesterol Absorption in NPC1L1 (NPC3) 
Knockout Mice (Fecal Ratio Method: Cholesterol/Sitostanol). 

15 In this example, cholesterol absorption and the activity of ezetimibe 

was determined in the NPC1L1 knockout mice (-/-), heterozygous mice (+/-), and age 
matched wild-type mice (4-/+). 

Cholesterol absorption in the mice was determined by the dual fecal 
isotope ratio method as described by Altmann et al. (Biochim. Biophys. Acta. 

20 1 580(1): 77-93 (2002)). Mice (n= 4-6/group) were fed a standard rodent chow diet 
and in some groups treated daily with a maximally effective dose of ezetimibe (10 
mg/kg). Mice were gavaged with I4 C-cholesterol (l|iCi, O.lmg unlabeled cholesterol) 
and 3 H-sitostaaol (2jiCi) in 0.1 ml corn oil. Feces were collected for 2 days and fecal 
14 C-cholesterol and 3 H-sitostanol levels were determined by combustion in a Packard 

25 Oxidizer. The fraction of cholesterol absorbed, as evaluated by the fecal dual isotope 
technique, was similar in wild type (+/+) and heterozygous mice (+/-) fed a chow diet 
(heterozygous mice absorbed 46 ±5% and age matched wild type mice absorbed 51 
±3% of the dose of 14 C-cholesterol). The NPC1L1 knockout mice (-/-) absorbed 15.6 
±0.4% of the I4 C-cholesterol, which was similar to the wild type mice treated with a 

30 maximally effective dose of ezetimibe (1 6. 1 ±0.3%), and reduced by 69% compared 
to wild type mice (pO.OOl). In NPC1L1 knockout treated with ezetimibe at 10 
mg/kg/day, cholesterol absorption was similar to that seen in the untreated knockout 
mice (16.2 ±0.6% compared to 15.6% ±0.4%, respectively). Thus, the majority of 
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cholesterol absorption is dependent on the presence of NPC1L1 and the residual 
cholesterol absorption in mice lacking NPC1L1 is insensitive to ezetimibe treatment. 
These results indicate that NPC1L1 is involved in the small intestinal enterocyte 
uptake and absorption of cholesterol and is in the ezetimibe sensitive pathway. 

5 Example 25: Mouse Screening Assay (Acute Cholesterol Absorption). 

The following screening assay is used to identify the presence of an 
NPC1L1 antagonist in a sample. 1 

Mice deficient in NPC1L1 (-/-) are generated by breeding heterozygote 
mice (+/) to obtain wild-type (+/+) and NPC1L1 deficient mice (-/-). 
10 In a first set of experiments, non-fasted mice (6.5-9 weeks old, mixed 

129 and C57BL/6 background) are weighed and grouped (n=l to 4 and n=l to 4 
+/+). All animals are gavaged (Feeding needles, 24G x 1 inch, Popper and Sons, NY) 
with 0.1 ml corn oil (Sigma; St. Louis, MO) containing l\xd 14 C-cholesterol (New 
England Nuclear, f^C] Cholesterol, NEC-018) and O.lmg carrier cholesterol mass 
1 5 (Sigma; St. Louis, MO). 

La another set of experiments, 1 to 4 wild-type NPC1L1 mice (+/+) are 
treated identically to the mice in the first set of experiments, above, except that the 
mice are additionally fed a sample to be tested for the presence of an NPC1L1 
antagonist. 

20 Two hours later, blood is collected from each mouse by heart puncture. 

The liver is removed, weighed, and three samples are placed into 20 ml counting 
vials. Tissues are digested in 1 ml of IN NaOH at 60°C overnight The tissue digests 
are acidified by addition of 250pl of 4N HC1 prior to liquid scintillation counting 
(LSC). Plasma is isolated by centrifugation at 10,000 rpm for 5 minutes in a 

25 microfuge and duplicate 100jil aliquots of plasma are taken for LSC. 

Cholesterol absorption, evaluated by this acute technique is expressed 
as the total amount of radioactive cholesterol in the plasma and liver. The sample 
tested is determined to contain an NPC1L1 antagonist when the level of cholesterol 
absorption (as measured by the above described methods) in the wild-type NPC1L1 

30 mouse (+/+ ) which was fed the sample and in the NPC1L1 deficient mouse (-/-) are 
less than the amount of cholesterol absorption in the wild-type NPCJLJ mouse (+/+) 
which was not fed the sample. 
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Example 26: Mouse Screening Assay 

(Fecal Ratio Method: Cholesterol/Sitostanol). 

The following screening assay is used to identify the presence of an 
NPCILI antagonist in a sample. 
5 Cholesterol absorption in the mice is determined by the dual fecal 

isotope ratio method as described by Altmann et ah (Biochim. Biophys. Acta. 
1580(1): 77-93 (2002)). 

Three groups of mice (n=l-6/group) are assembled. Two separate 
groups comprise wild-type NPC1L1 mice (+/+) and one group comprises NPCILI 
1 0 deficient mice (-/-). 

Each group is fed a standard rodent chow diet and in some groups 
treated daily. Mice are gavaged with 14 C-cholesterol (ljiCi, O.lmg unlabeled 
cholesterol) and 3 H-sitostanol (2jiCi) in 0.1ml corn oil. One group of mice, which 
comprise wild-type NPCILI mice (+/+) are further fed a sample to be tested for the 
15 presence of an NPCILI antagonist. Feces are collected for 2 days and fecal 14 C- 
cholesterol and 3 H-sitostanol levels are determined by combustion in a Packard 
Oxidizer. 

The sample tested is determined to contain an NPCILI antagonist 
when the level of cholesterol and/or sitostanol absorption (as measured by the above 
20 described methods) in the wild-type NPCILI mouse (+/+) which was fed the sample 
and in the NPCILI deficient mouse (-/-) are less than the amount of cholesterol and/or 
sitostanol absorption in the wild-type NPCILI mouse (+/+) which was not fed the 
sample. 

Example 27: Binding Analysis Using Brush Border Membrane Vesicles 
25 The following screening assay may be used to identify the presence of 

an NPCILI ligand in a sample. 

Materials. The following two compounds were synthesized for the 
binding assay described herein , 3 H-ezetimibe glucuronide 1 (34.5 Ci/mmol) and its 
35 S-propargyl-sulfonamide derivative 2 (800-1100 Ci/mmol). 

30 
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S-propargyl-sulfonamide 
ezetimibe-glucuronide 
2 

Synthesis of ezetimibe glucuronide and S-propargyl-sulfonamide 
ezetimibe-glucuronide. Ezetimibe gliacuronide (compound 1) (also referred to as 
5 EZE-glucurorride) can be made according to the procedures in U.S. Patent No. 
5,756,470. The general scheme below illustrates a method for the synthesis of 
compound 2 and radiolabel led 35 S-2. 



Ezetimibe-glucuronide 
1 
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H 




NHS0 2 M« -f 



1 



DMP 




*v NHS0 2 CH s 



EtjN 



CH,OH, H 2 0 



9H 




NHSOjCH, 
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Preparation of compound 33 S-2 fCompound 2 with radiolabeled 35 S) 
Step A: Preparation of [^S]N-prop-2-yn-l-ylmethanesulfonamide (i). 
The appropriate volume of [35s]methane sulfonyl chloride (see Dean, D.C.; et al., 
J. Med. Chem. 1996, JP, 1767) totaling 3.5 mCi was removed from a stock solution 
5 in methylene chloride and placed in a 5mL conical flask. It was then distilled at 
atmospheric pressure until the volume was approximately 50 \iL. To this solution 
was immediately added 50 pJL of propargylamine. After 1 5 min ; the reaction mixture 
was diluted with 10 mL of ethyl acetate, washed with saturated sodium bicarbonate 
solution (3x2 mL), and dried over sodium sulfate. After filtration the resulting 
1 0 solution had a count of 3 .3 mCi and a radiochemical purity of 99.9 % by HPLC 

(Zorbax XDB C8 column, 4.6 x 150 mm, 5 % acetonitrile:H 2 0 (0.1 % TFA) to 100 % 
acetonitrile, 1 5 min linear gradient, 1 mL/min, t K = 4.4 min). 

StepB: Pr^aration of [35s] 4-[(2^,3^)-3-[(3iS)-3-(4-fluorophenyl>-3- 
hydroxypropyl]- 1 -(4- {3-[(methylsulfonyl)amino]prop- 1 -yn- 1 -yl}phenyl)-4- 

15 oxoazetidin-2-yl]phenyl methyl P-D-glucopyranosiduronate ([35s]) (iii). Dissolved 
3.0 mCi of [3 5 S] AT-prop-2-yn- 1 -ylmethanesulfonamide, 1 mg of compound ii 
(prepared according to Burnett, D.S. et al., Bioorg. Med. Chem. Lett. (2002), vol. 12, 
p. 3 1 1), and 1 |iL of triethylamine in 100 uL of dimethylfonnamide inside a plastic 
microcentrifuge tube. To this solution was added lO jiL of a stock solution containing 

20 8. 1 mg of teh^s(triphenylphosphine)palladium(0) and 1 .4mg of copper iodide in 1 
mL of dimethylfonnamide. Stirred at room temperature for sixty hours at which time 
HPLC indicated 55% conversion. This reaction mixture, which had a radiochemical 
purity of 44.4% by HPLC (Zorbax XDB C8 column, 4.6 x 1 50 mm, 5 % 
acetonitrile:H 2 0 (0.1 % TFA) to 100 % acetonitrile, 15 min linear gradient, 1 

25 mlVmin, *r = 9.3 min) was taken on directly to the next step. 

Step C: Preparation of [35s] 4-[(2S,3R)-3-[(3S)-3-(4-fluorophenyl>3- 
hydroxypropylH -(4- {3-[(methylsulfonyl)amino]prop- 1 -yn-l-yl}phenyl)-4- 
oxoazetidin-2-yl]phenyl p-D-glucopyranosiduronic acid 35 S-2. The crude reaction 
mixture containing compound iii was treated with 25 jiL of methanol, 90 uL of water, 
30 and 30 \xL of triethylamine and stirred at room temperature for one hour at which time 
it was concentrated to near dryness under a slow stream of nitrogen. The residue was 
dissolved in 1 :1 acetonitrile:H20 and subjected to semi-preparative chromatography 
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(Zbrbax XDB C8 250 x 9.4 mm column, 70:30 acetonitrile:H 2 0 (0.1 % TFA) 4 
rnL/min, 1 x 0.2 mL injections). 540 |iCi of product was obtained which had a 
radiochemical purity of 99.9% by HPLC (Zorbax XDB C8 column, 4.6 x 150 mm, 
70:30 acetonitrile:H 2 0 (0.1 % TFA), 1 mL/min, / R = 10.4 min) and coeluted with an 
5. authentic sample of compound 2. LC/MS m/z = 508 (product - glucuronide - H 2 0), 
SA = 769 Ci/mmol. 

Alternate Preparation of 35 S-2. 

Step A: Preparation of fiL The appropriate volume of [ 35 S]methane 
sulfonyl chloride (see Dean, D.C.; et al., J. Med. Chem. 1996, 3P, 1767) totaling 1.3 
10 mCi was removed from a stock solution in methylene chloride and placed in a 5mL 
conical flask. It was then distilled at atmospheric pressure until the volume was 
approximately 50 \xL. To this solution was immediately added a solution of 1 mg of v 
in 5 nL of pyridine (freshly distilled over calcium hydride). 




The solution was stirred at room temperature for five minutes at which 



time it was concentrated to near dryness under a slow stream of nitrogen. This 
reaction mixture, which had a radiochemical purity of 80. 1% by HPLC (Zorbax XDB 
C8 column, 4.6 x 150 mm, 5% acetonitrile:H 2 0 (0.1 % TFA) to 100 % acetonitrile, 15 
20 min linear gradient, 1 mL/min, /r = 9.3 min) was taken on directly to the next step. 

Step B:Preparation of 35 S-2. The crude reaction mixture containing iii 
was treated with 25 \xL of methanol, 90 \\L of water, and 30 \xL of triethylamine and 
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stirred at room temperature for one hour at which time it was concentrated to near 
dryness under a slow stream of nitrogen. The residue was dissolved in 1 :1 
acetonitrile:H20 and subjected to semi-preparative chromatography (Zorbax XDB C8 
250 x 9.4 mm column, 70:30 acetonitrile:H 2 0 (0.1 % TFA) 4 mL/min, 1 x 0.2 mL 

5 injections). 350 jiCi of product was obtained which had a radiochemical purity of 98.4 
% by HPLC (Zorbax XDB C8 column, 4.6 x 150 mm, 70:30 acetonitrile:H 2 0 (0.1 % 
TFA), 1 mL/min, *r = 10.4 mm) and coeluted with an authentic sample of 2. LC/MS 
m/z = 508 (product - glucuronide - H 2 0), SA * 9 1 1 Ci/mmoL 

Following the same general procedure for synthesis of 35 S-2, except 

1 0 omitting the radiolabel ling, compounds 2 and jv can be prepared. 

Preparation of brush border membrane vesicles (BBMV). 
Membranes were prepared from Rhesus macaque (Macaca mulatta), rat (male 
Sprague-Dawley), and mouse (male C57BL/6J) intestines, using Mg 4 "* precipitation 
method described in the following references and with modifications described below 

15 (Hauser, H., Howell, K., Dawson, R.M.C., Bowyer, D. E. Biochim. Biophys. Acta 
602, 567-577 (1980); Kramer, W., Girbig, F., Gutjahr, U., Kowalewski, S., Jouvenal, 
IC, Muller, G., Tripier, D., Wess, G. J. Biol. Chem. 268, 18035-18046 C1993); 
Rigtrup, K.M., Ong, D.E. Biochemistry H, 2920-2926 (1992)). 

The intestines from freshly sacrificed animals were cut into segments, 

20 perfused with ice-cold saline buffer (Buffer A: 26 mM NaHC0 3 , 0.96 mM NaH 2 P0 4 , 
5 mM HEPES, 5.5 mM glucose, 1 17 mM NaCl, 5.4 mM KC1, pH = 7.4), placed on 
cold glass plates, opened longitudinally, and the mucosa scraped with glass 
microscope slips. This mucosa could be used fresh or frozen with identical results. 
To prepare the membranes, the mucosal scrapings were resusp ended in 20 volumes of 

25 cold buffer consisting of 300 mM D-mannitol, 5 mM EGTA, 12 mM Tris, pH 7.4 
with HC1, and containing 0.1 mM PMSF and a 1% dilution of a protease inhibitor 
cocktail (set 1, Calbiochem). They were homogenized using a Polytron at medium 
speed on ice until inspection with a microscope indicated complete cell lysis. Then, 
solid MgCk was added slowly with stirring to a final concentration of lO mM, and the 

30 solution was kept stirring on ice for 1 5 min. Cellular debris was removed by 

centrifugation for 15 min at 3,000g, and the membranes were recovered "by 

« 

centrifugation for 60 min at 48,000g. The membranes were further rinsed by re- 
suspension in a buffer containing 50 mM D-mannitol, 5 mM EGTA, and 2 mM Tris 
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at pH 7.40, and centrifiigation for 60 min at -48,000g. The final pellet was 
resuspended in 120 mM NaCl and 20 mM Txis at pH 7.40 to a concentration of -1 0- 
20 mg protein/ml, aliquoted, frozen in liquid nitrogen, and stored at -80C. The 
activity was stable indefinitely and could be freeze-thawed with minimal loss of 
5 activity. 

Membrane protein was measured by the Bradford assay (Bradford, 
M.M. Anal. Biochem. 22, 248-254 (1976)) using bovine serum albumin as standard. 
The enrichment in brush border membranes "was assessed using gamma- 
glutamyltransferase as a marker enzyme (Kramer, W., Girbig, F., Gutjahr, U., 
10 Kowalewski, S., Jouvenal, K., Muller, G. 9 Tripier, D., Wess, G. J. Biol. Chem. 268, 
18035-18046 (1993)), which indicated a 6-fold enrichment over the initial 
homogenate. 

Binding assay. Assays were conducted in 12 x 75 mm glass test tabes 
and total volume 20-100 ul. In general, frozen membranes were diluted in buffer A or 

15 buffer A containing 0.03% taurocholate and 0.05% digitonin to a final concentration 
of 0.02 to 5 mg/ml. Radiolabeled ligands were typically 25-50 nM for ^-ezetimibe 
, (EZE)glucuronide 1, and 3-5 nM for its 35 S analog 2, in the assay, and they were 
delivered as DMSO or CH3CN solutions. Competing ligands were likewise added as 
DMSO solutions to give a total 2-10 % organic solvent content. Nonspecific binding 

20 was defined by competition with 100 uM ezetimibe glucuronide. At least 2 

components of buffer A, the bicarbonate and. phosphate salts, were later found to be 
inconsequential and were routinely omitted. To ensure equilibrium was established, 
the reactions with compound 1 were incubated at least 3 hours for rhesus membranes 
and at least one hour for rat membranes at ro om temperature, and with compound 2 as 

25 long as 2 hours at 37°C with rhesus and rat banish membranes. Additionally, reactions 
with compound 2 were incubated as long as :2 hours at 37°C with membranes from 
HEK-293 cells expressing mouse, rat or huDaan NPC1L1 . 

Bound ligand was quantified "by single-tube vacuum filtration using 
GF/C glass fiber filters. Glass fiber filters (GF/C) were obtained from Whatman. The 

30 filters were pretreated by soaking with 0.5% polyethylenimine to reduce nonspecific 
binding. Filtration was accomplished by adding 2.5 ml of ice cold buffer (120 mM 
NaCl, 0.1% sodium cholate, and 20 mM ME S at pH 6.70) to the assay tube, pouring 
the mixture through the filter, and then rinsing the tube and filter twice more with 
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another 2 x 2.5 ml buffer. The filters were counted in 7 ml vials using Packard DM 
liquid scintillation fluid. Where triplicate assays were performed, the standard error 
was typically <4%. As an example, a 100^1 assay of rat brush border membranes at 2 
mg/ml in the presence of 400,000 dpm (50 nM) 3 H-ezetimibe glucuronide gave 
5 1 5,000 dpm specific and 3,000 dpm nonspecific binding. The filters contributed most 
of the nonspecific binding (2,000 dpm). 

Alternatively, vacuum filtration of compound 2 on a Millipore 96-well 
plate (Whatman GF/C) can also be used to achieve adequate precision. 

Data analysis. Data from saturation experiments were subjected to a 

10 Scat chard analysis, and linear regression was performed to yield the equilibrium 

dissociation constant (Kd) and maximum receptor concentration (Bmax). Correlation 
coefficients for these determinations were typically greater than 0.96. Data from 
competition experiments were analyzed and IC50 values determined from Hill plots of 
the binding data. The kinetic data for ligand association and dissociation were 

15 subjected to the analysis of Weiland and Molinoff (Weiland, G., MolinofF, .B. Life 
Sci. 29, 313-330 (1981)). The dissociation rate constant for (k-i) was determined 
directly for a first order plot of ligand dissociation versus time. The rate of ligand 
association (ki) was determined from the equation ki = kobs([LRe]/([L] [LR]^)) 
where [L] is the concentration of ligand, [LRJ is the concentration of the complex at 

20 equilibrium, [LR] 

max 

is the maximum number of receptors present, and ko^s is the 

* 

slope of the pseudo-first order plot L„ ([LE]o/([LR] c - [LR]t)) versus time. 

Binding analysis. Ezetimibe is rapidly converted to its glucuronide in 
vivo, and this metabolite is thought to be largely if not exclusively responsible for 
inhibition of cholesterol uptake. Accordingly, both 3 H-ezetimibe and its 

25 corresponding glucuronide derivative (1) were prepared and tested for binding to 

intestinal brush border membrane preparations, using a single-tube vacuum filtration 
technique. As a result of the hydrophobic nature of 3 H-ezetimibe, high nonspecific 
binding was observed, precluding its use as a radioligand for the binding assay. 
However, due to the improved physical properties of the glucuronide derivative (1), 

30 specific binding was observed with this radioligand and it was used to assess binding 
in rhesus, rat, and mouse intestinal brush border membranes. 

Rhesus, rat, and mouse intestinal scrapings were homogenized and the 
brush border membranes isolated. Specific binding was observed exclusively in the 
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membrane fraction. Plots of total, nonspecific, and specific bidding to rhesus (Figure 
1) and rat (Figure 2) brush border membranes. Aliquots of rhesus BBMV (83 
jigtessay) or rat BBMV (250^gtessay) were incubated with increasing concentrations 
of 3 H-EZE-glucuronide. Total binding and nonspecific binding determined in the 
5 presence of 10-100 uM EZE-glucuronide are shown. Specific: binding was calculated 
from the difference between total and nonspecific binding. Data were fit by nonlinear 
regression as described above, and the linear Scatchard plot is shown. In rhesus 
membranes, the data correspond to a single binding site with IC<j = 41 nM and a 
concentration of 5.5 pmol/mg membrane protein. The affinity is ~ 10- fold lower in rat 
1 0 membranes (Kd = 540 nM). ^-EZE-glucuronide is not the best ligand for a binding 
assay for the mouse target due to the compounds low affinity ixi mouse membrane. 
These potencies correlate roughly with the sensitivity of these species to ezetimibe 

inhibition of cholesterol uptake. 

Rate constants for binding and dissociation. Ezetimibe-gluciironide 

15 is slow to bind, and forms a relatively long-lived complex with* its receptor. Indeed, 
this was key to detecting the interaction in a traditional filter-binding assay, as 
ligand/receptor interactions with Kd values greater than 100 nM often go 
unrecognized because of the typical fast off-rates of the ligands. Rate constants for 
association (kon) and dissociation (kotd of compound 1 were determined for rat and 

20 rhesus membranes, and used as an alternative method to calculate the dissociation 
constant (Kd) according to the relationship Kd = Wkon. 300M@/assay of rat brush 
border membrane vesicles were incubated with 25nM 3 H-EZE-glucuxonide at room 
temperature for up to three hours for the association kinetic stuxlies. 83 p.g/assay of 
rhesus brush border membrane vesicles were incubated with 2SnM H-EZE- 

25 glucuronide at room temperature for up to five hours for the association kinetic 

studies. Nonspecific binding measured in the presence of IOOjaM EZE-glucuronide 
was subtracted from the total binding to calculate the specific binding shown in 
Figures 3 A and 4A. For the dissociation kinetic study, rat brusli border membrane 
vesicles were incubated with 25nM 3 H-EZE-glucuronide for 2 Iiours at room 

30 temperature and ligand dissociation was initiated by the addition of 1 OOjaM EZE- 
glucuronide. Rhesus brush border membrane vesicles were incubated with 42nM 
3 H-EZE-glucuronide for 4 hours at room temperature and ligand dissociation was 
initiated by the addition of lOOpM EZE-glucuronide. For both rat and rhesus 
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dissociation studies, samples were collected at various times and radiolabel was 
detected. Dissociation curves are shown in Figures 3B (rat) and 4B (rhesus). 

For rat membranes, the rate constant for association is kon = 5,540 M" 1 
s" 1 (compared to 10 8 to 10 9 MT l s' 1 for diffusion controlled encounter), and the rate 
5 constant for dissociation is koff - 2.4 x 10' 3 s' 1 , corresponding to a half-life of 4.7 min. 
The data are shown in Figure 3, where the solid lines are theoretical for these rate 
constants. The Kd value predicted from these rate constants (Kd = IWkon = 440 nM) 
agrees well with that measured at equilibrium (Kd = 540 nM). 

For rhesus membranes, where 3 H-ezetimibe glucuronide is at least 
10 10-fold more potent (as described above), the association rate remains the same 

but the half-life for dissociation of the complex increases to —90 min. These data are 
shown in Figure 4, where the theoretical lines correspond to kon 53 3,900 M" 1 s* 1 and 
koff — 1.23 x 10" 4 s" 1 , and predict Kd— 32 nM compared to that measured at 
equilibrium (Kd = 41 nM). 

1 5 Example 28: Binding Analysis of a Potent NPC1L1 ligand 

A 35 S-labeled propargyl-sulfonamide analogue of ezetimibe 
glucuronide ( 35 S-2) was identified as a potential NPC1 LI antagonist. Compound 2 
was prepared as described in Example 27 and found to have markedly improved 
affinity for some species of brush border membranes vesicles. For rhesus brush 

20 border membranes vesicles, 56 ng protein/assay were incubated with 25 nM 3 H-EZE- 
glucuronide in the presence of increasing concentrations of EZE-glucuronide and 2. 
For rat brush border membranes vesicles, 1 50 jig protein/assay were incubated with 
50 nM 3 H-EZE-glucuronide in the presence of increasing concentrations of EZE- 
glucuronide and 2. For mouse brush border membranes vesicles, 20 jig protein/assay 

25 were incubated with 3 nM S-2 in the presence of increasing concentrations of EZE- 
glucuronide and 2. 

2 is more potent against enterocyte brush border membrane 
preparations from rats (35-fold), but is equipotent with ezetimibe glucuronide for 
rhesus membrane preparations (Figure 5, Table 8). It also has enhanced affinity for 

30 mouse membranes (Figure 6, Table 8). 
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Table 8. Summary of inhibition constants (Ki) for binding of ezetimibe 
glncuronide 1 and its proparjgyl-snlfonamide derivative 2 to rhesus, rat, and 
mouse intestinal brush border membranes. 



Compound 


Rhesus 


Rat 


Mouse 


I 


39 


530 


2,300 


2 


38 


15 


144 



5 IQ values are nM. 



Example 29 : Distribution of 3 H-eze timibe glucur onide (1) 
binding to intestinal tissues. 

Previous studies have established that cholesterol absorption occurs 

primarily in the jejunum, and is substantially lower in the ileum and duodenum. To 

10 determine if the binding activity is similarly distributed, the binding assay using 3 H- 
ezetimibe glucuronide ( 3 H-1) as a radioligand was used to determine the distribution 
of binding sites in sections from rhesus and rat intestines. 

For the rhesus studies, 1 0 cm corresponding to the ileum of a rhesus 
small intestine was separated and the remaining intestine was divided into three 

15 segments, (proximal, middle and distal) of equal length (70 cm each). For the rat 
studies, 10 cm corresponding to the ileum of a rhesus small intestine was separated 
and the remaining intestine was divided into three segments, (proximal, middle and 
distal) of equal length (36 cm each). Brush border membrane vesicles were prepared 
as described in Example 27. Aliquots of vesicles (100-200^ig) protein/assay were 

20 incubated with 50nM 3 H-EZE-glucuronide in the absence and presence of IOOjjM 
EZE-glucuronide. 

As shown in Figure 7, specific binding for 3 H-ezetiniibe glucuronide 
peaks in the jejunum in both species, consistent with the previously observed pattern 
of cholesterol absorption. 

25 Example 30: Correlation of in vitro and in vivo binding activity of NPC1L1 

To determine if in vitro binding activity is predictive of in vivo 
efficacy, the enantiomer of ezetimibe glucuronide and several close structural 
analogues of ezetimibe glucuronide that were tested in the rat membrane binding 
assay were tested in an acute rat cholesterol absorption study as described in 

30 Examples 23-26. The selected analogs had a range of in vitro potencies, and were 
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anticipated to have similar physical properties to ezetimibe gjucuronide (Tables 9 and 
1 2). The enantiomer, which has a Kd > 1 00,000 nM for the rat target, was inactive in 
the in vivo assay. For the other analogs, the same rank order of potency is observed in 
the in vitro and in vivo assays, further evidence that the observed binding is due to the 
5 target of ezetimibe. 



Table 9. IC50 values of EZE-gluc and analogs to inhibit binding of 3H-EZE-gluc 
to rat brash border membrane vesicles. 



Compound Name 


R* 


X 


Y 


RAT ICso (nM) 


3 


H 


H2 


F 


2,300 


EZE-gluc 1 


H 


(S)-OH 


F 


530 


EZE-gluc enantiomer 4 


. H 


(R)-OH 


F 


>100000 


5 


H 


(R)-OH 


F 


3,900 


6 


H 


=o 


F 


70,000 


7 


OH 


(S)-OH 


F 


252 




Y 

10 

compounds 1, 3, 5, 6 and 7. 
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Backbone structure for compound 4. 

Example 31: Binding affinities of ezetimibe glucuronide and its analogs to 
recombinant NPC1L1 

NPC1L1 was identified as a candidate target of ezetimibe from a 
5 search of genetic databases for cholesterol binding motifs. Subsequently, NPC1L1 
deficient mice were found to have 80% reduction of cholesterol absorption, and did 
not respond to ezetimibe treatment, strongly suggesting that this protein is required 
for ezetimibe efficacy. To determine if NPC1L1 is the direct target of ezetimibe, 
binding affinities were compared for ezetimibe glucuronide and several analogs to 
1 0 NPC 1 LI transfected cells and rat brush border membrane vesicles. 

RatNPClLl transfected CHO cells (-500,000 cells/assay) were 
incubated with 5 nM 35 S-2 (-1 million dpm/assay) for 2 hours at 37 °C in the absence 
or presence of increasing concentrations of EZE-glucuronide (compound 1), 
compounds 2, 3, 5, 6, or 8. Compound 8 is an analog of compound 2 wherein the 
1 5 hydroxyl group in the 3-hydroxylpropyl moiety of 2 is replaced with an oxo group. 

Human NPC1L1 transfected CHO cells (-600,000 cells/assay) were 
incubated with 5 nM 35 S-2 (~1 million dpm/assay) in buffer A for 2 hours at 37 °C in 
the absence or presence of increasing concentrations of EZE-glucuronide (compound 
1), compounds 2, 3, 5, 6, or 8. 
20 As shown in Figures 9 and 12, and Table 10, the affinities for the 

recombinant and native proteins are virtually identical, providing compelling evidence 
that NPC 1 LI is the direct target of ezetimibe in mammalian tissues, and that other 
proteins are not required for binding. 

Affinities of ezetimibe glucuronide and analogues thereof were also 
25 determined for human recombinant NPC1 LI . The results, shown in Figure 9, indicate 
(hat ezetimibe gjucuronide (1) has an affinity for the human protein of 907 nM. The 
propargyl-sulfonamide analogue (2) is approximately 50-fold more potent, with a 
K<j = 21 nM, suggesting that this compound has the potential for enhanced potency of 
cholesterol absorption inhibition in man. 
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Table 10: Comparison of inhibition constants (Ki) for binding to native rat 
intestinal brush border membranes and membranes from rat NPC1L1 
transfected cells. 



Analog 


Recombinant rat NPC1L1 
Ki,nM 


RatBBMV 
Ki, nM 


EZE-glucuronide 1 


790 


600 


2 


12 


15 


3 


2400 


2300 


6 


84500 


70000 


5 


5800 


3900 


8 


556 


818 



5 Example 32: Binding of to membranes from wild type and NPC1L1 
deficient mice. 

Final confirmation that NPC1L1 is the target of ezetimibe was 
provided by binding studies with 35 S-2 in intestinal brush border membranes from 
NPC1L1 deficient and control mice. 

10 Brush border membranes vesicles were prepared from intestinal tissues 

of wild type and NPC1L1 knockout (-/-) mice. 15,30 and 60|ig protein/ assay of 
brush border membranes vesicles were incubated with 4nM 35 S-2 in buffer A for 
3 hours at 37°C in the presence and absence of 100|iM EZE-glucuronide. 

The results, shown in Figure 10, indicate that no detectable binding is 

1 5 observed in membranes from NPC1 LI deficient mice, whereas age matched wild type 
control membranes have detectable binding. The binding affinity observed in this 
experiment in control mouse membranes (Kd=156 nM) was virtually identical to that 
observed in previous studies (Figure 11). 

Example 33: Binding Analysis Using Brush Border Membrane Vesicles from 
20 Rat Mouse and Rhesus Monkey 

Binding studies were performed to compare the relative binding 
affinity of ezetimibe glucuronide to various brush border membrane vesicles. 
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H-ezetknibe glucuronide 1 was prepared as described in Example 27. 
The brush border membranes were prepared as described in Example 27. 

Binding Assay. Assays were conducted in 12 x 75 mm glass test tubes 
and total volume 20-1 00 jil. In general, frozen membranes were diluted in buffer A or 
5 buffer A containing 0.03% taurocholate and 0.05% digitonin to a final concentration 
of 0.5 to 5 mg/vol (Buffer A: 26 mM NaHC0 3 , 0.96 mM NaHJO* 5 mM HEPES, 
5.5 mM glucose, 117 mM NaCl, 5.4 mM KC1, pH = 7.4). Final concentrations of 
[ 3 H]ezetimibe glucuronide 1 were typically 25-50 nM , and were delivered as DMSO 
or CH 3 CN solutions. Competing ligands were likewise added as DMSO solutions to 

10 give a total 1-5 % organic solvent content Nonspecific binding was defined by 

competition with 100-500 [iM ezetimibe glucuronide. At least three components of 
buffer A, the bicarbonate and phosphate salts, and glucose, were later found to be 
inconsequential and were routinely omitted. Reactions were incubated until 
equilibrium was achieved (one hour for rat or three hours for rhesus membranes). 

1 5 Bound ligand was recovered by single-tube vacuum filtration on 

Whatman GF/C glass fiber filters. The filters were pretreated by soaking with 0.5% 
polyethylenimine to reduce nonspecific binding. Filtration was accomplished by 
adding 2.5 ml of ice cold buffer (120 mM NaCl, 0.1 % sodium cholate, and 20 mM 
MES at pH 6.7) to the assay tube, pouring the mixture through the filter, and then 

20 rinsing the tube and filter twice more with another 2 x 2.5 ml buffer. The filters were 
counted in 7 ml vials using Ultima Gold MV liquid scintillation fluid from Packard. 
Where triplicate assays were performed, the standard error was typically <4%. As an 
example, a 1 OOjii assay of rat brush border membranes at 2 mg/ml in the presence of 
400,000 dpm (50 nM) [ 3 H]ezetimibe glucuronide gave 15,000 dpm specific and 3,000 

25 dpm nonspecific binding. The filters contributed most of the nonspecific binding 
(2,000 dpm). 

Data Analysis. After correction for nonspecific binding, saturation- 
binding data were fit by nonlinear regression (Sigma Plot) to the single-site 
expression [B] = Bmax x [L]/([L] + Kd). Linear Scatchard plots are shown for 
30 illustration. Data on K* from competition experiments were analyzed by nonlinear 
regression to the expression [B] = [B 0 ]/(l + P]/Ki 0bs ), and where required were 
corrected for radioligand competition as K» = Kj obs /(l + [L*]/Kd). 
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First-order rate constants (kobs and kos) were determined by nonlinear 
regression to the first order rate equation A - Aoe~ to . Kinetic data for ko n were 
analyzed according to Weiland and MolinofF (32), using the equation kon = kobs 
([LR]e/([L] [LR]max))» where [L] is the concentration of ligand, [LR] e is the 
5 concentration of the complex at equilibrium, [LRJmax is the maximum number of 
receptors present, and kobs is the apparent first-order rate constant 

Binding analysis. Binding studies using the [ 3 H]ezetimibe glucuronide 
a traditional rapid-filtration assay on glass fiber filters using enterocyte brush border 
membrane preparations from rat, mouse and rhesus monkey were performed (Table 

10 11). Table 1 1 shows the binding affinities of [ 3 H]ezetimibe glucuronide to the 
membranes in the absence of detergents. The observed binding was specific, 
saturable, and consistent with a single molecular site. Scatchard analyses and the 
specific/nonspecific binding windows for rat and monkey are shown in Figure 12. 
The binding affinity is relatively weak in rat membranes (Kd = 542 nM) and even 

1 5 weaker in murine membranes (Kd = 1 0,000 nM). In contrast, binding affinity in 
rhesus monkey membranes is approximately 1 0-fold greater (Kd = 41 nM). The 
number of binding sites varied from 5 — 20 pmol/mg membrane protein depending on 
species and preparation. 

The rates for binding and dissociation of [ 3 H]ezetimibe glucuronide 

20 were determined and found to be slow relative to those typically observed for protein- 
ligand interactions. For example, the rate constants for association to rat and monkey 
brush border membranes are kon= 5.54 and 3.90 x 10 3 M" ! s" 1 (Figure 12). These are 
100,000-fold smaller than those typically observed for a diffusion controlled 
encounter, 10 8 to 10 9 M" 1 s" 1 . Similarly, these complexes are unusually long-lived, 

25 dissociating with rate constants of koff= 2.4 x 10~ 3 s" 1 and 1 .2 x 10* 4 s' 1 at 25C, 
equivalent to half-lives of 4.7 and 96 min for the rat and monkey complexes, 
respectively. In comparison, half lives are normally <1 sec for dissociation of 
common diffusion controlled, 1 00 nanopolar K D ligands. These rate constants predict 
Kd values (K D = koff/kon) of 440 and 32 nM,' respectively, which agree well with those 

30 measured by equilibrium titration (Figure 12), and by saturation as described earlier. 
Such slow-forming, long-lived complexes suggest that conformational changes in the 
protein are rate limiting. 
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Table 11: Comparison of ezetimibe binding affinity and cross species efficacy 

Species I K D (nM) I ED 50 (mg/Kg) 

mouse 12,000 0.5000 

"rat 542 0.0300 

monkey 41 0.0005 " —— ~ 



Table 1 1 also shows a correlation between in vitro and in vivo binding 
of [ 3 H]ezetimibe glucuronide in various enterocyte brush border membrane 
5 preparations from rat, mouse and rhesus monkey. The in vivo ED 50 values are derived 
from cholesterol absorption and cholesterol feeding studies. The rank order of 
ezetimibe potency (ED50) in vivo as follows: rhesus (0.0005mpk) > rat (0.03mpk) 
> mouse (0.5mpk) is the same as the order of in vitro binding affinity (IC50) as 
follows: rhesus monkey (41nM) < rat (542nM) < mouse (12,00OnM). 

1 0 The binding affinities of 1 to brush border membranes correlate well 

across species with the sensitivity to ezetimibe inhibition of cholesterol uptake in vivo 
(mouse < rat < monkey) (Clader, J. W. The discovery of ezetimibe; A view from 
outside the receptor. J Med, Chem. 47, 1-9 (2004); Davis, H.R. Jr., Compton, D.S., 
Hoos, L. & Tetzloff, G. Ezetimibe, a potent cholesterol absorption inhibitor, inhibits 

15 the development of atherosclerosis in ApoE knockout mice. Arterioscler. Thromb. 
Vase. Biol 21, 2032-2038 (2001); Burnett, DA Beta-lactam cholesterol absorption 
inhibitors. Curr. Med. Chem. 11, 1873-1887 (2004), consistent with the hypothesis 
that the assay is relevant to the target of ezetimibe in vivo (Table 11). As evidence 
that this interaction is very specific, the glucuronide of the enantiomer of ezetimibe 

20 was prepared and found to be completely inactive in vitro (Ki > 100 x Kp for 

ezetimibe glucuronide in all species), consistent with its lack of activity in vivo in a rat 
acute cholesterol absorption model (see Table 12 in which the enantiomer is 
analyzed). 

Example 34: NPC1L1 as the target of ezetimibe in NPClLl-expressing HEK293 
25 cells. 

This example demonstrates that ezetimibe binds specifically to 
NPC1 LI -expressing HEK293 cells. 

Transient expression of NPC1L1. Plasmid pCR3.1 expressing rat 
NPC1L1 (GenbankAY437867) or human NPC1L1 (Genbank AY437865) were 
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prepared using standard molecular biology protocols. HEK-293 cells (ATCC) were 
seeded at 10 x 10 6 cells per T-225 flask (Coming) in DMEM containing 10% fetal 
calf serum, 4.5 g/L D-glucose and L-glutamine, 18 hours prior to transfection. They 
were transiently transfected with 25 |ig of DNA using Fugene transfection reagent 
5 (Roche Biochemical) at a ratio of 6: 1 Fugene:DN A. Following transfection, the cells 
were incubated at 37°C and 5% CO2 for 48 hours, and then harvested using PBS 
based cell dissociation buffer (Gibco), pelleted at 500 x g, snap frozen on dry ice, and 

stored at -80°C. v 

Membrane preparation from HEK-293 cells. Membranes were 

1 0 prepared by resuspending the frozen cell pellets in ten volumes of 20 mM 

HEPES/Tris buffer at pH 7.40 containing 8% sucrose, and sonicating the suspensions 
with a probe sonicator on ice until most of the cells were lysed To isolate the 
membranes, the sonicates were centrifuged at 1600 x g for 10 min to remove cell 
debris, and then the supernatants were centrifuged at 125,000 x g for 1 hour to recover 

15 the membranes. These membranes were resuspended in 20 mM HEPES/Tris buffer at 
pH 7.40 containing 160 mM NaCl and 5% glycerol, and stored at 10-20 mg/ml 
protein at -80°C. 

Pursuing the recent evidence indicating that NPC1L1 is an important 
component of the pathway inhibited by ezetimibe, recombinant rat and human 

20 NPC1L1 were expressed in human embryonic kidney (HEK) 293 cells (Figure 13, 
Panel 1). Cell iysates from HEK-293 cells expressing NPCl LI (Lanes 1 and 3 of 
Panel 1 Figure 13) and wild-type cells (Lanes 2 and 4 of Panel 1 Figure 13) were 
analyzed by gel electrophoresis and Western blot with an anti-NPClLl antibody 
Al 801. An excess of NPC1L1 -specific peptide was included to assess specificity of 

25 the antibody for NPC1L1 (Lane 3 and 4 of Panel 1 Figure 13). Preliminary binding 
studies using 1 revealed specific binding to membrane preparations from cells 
expressing NPC1 LI > and no specific binding to membranes from mock transfected 

cells (not shown). 

Binding to NPC1L1 expressing cells was also observed with a 
3 O BODIPY-labeled fluorescent ezetimibe glucuronide analog (SCH354909) (Figure 13, 
Panel 2A). Panel 2 of Figure 13 shows confocal microscope images of a fluorescent 
ezetimibe glucuronide analog (SCH354909) bound to the surface of NPC1L1-293 
cells (Panel 2A), nonspecific binding of SCH354909 to NPCltl-293 cells in the 
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presence of 100 jjM unlabeled ezetimibe glucuronide (Panel 2B), binding of 
SCH354909 to wild type HEK 293 cells (Panel 2C), and nonspecific binding of 
SCH354909 to wild type HEK 293 cells in the presence of 100 nM unlabeled 
ezetimibe glucuronide (Panel 2D). In each case, plated cells were incubated in culture 
5 media with 500 nM SCH354909 for 4 hours at 37°C. Cells were subsequently 
washed with PBS and fluorescence was detected using confocal microscopy. 

Binding of SCH345909 was clearly evident at the cell surface 
membrane of the NPC1 LI -expressing cells and was completely abolished in the 
presence of excess unlabeled ezetimibe glucuronide (Figure 13, Panel 2C). No 
10 binding was observed in wild type HEK 293 cells (Figure 13, Panels 2B and 2D). 

These results demonstrated that ezetimibe glucuronide binds specifically to NPC1L1 . 

Example 35: NPC1L1 as the in vivo target of ezetimibe. 

To obtain evidence that NPC1L1 is the direct binding target of 
ezetimibe in vivo, binding affinities of 1 and several key analogs were determined for 

1 5 recombinant rat and human NPC1L1 expressed in HEK-293 cell membranes and 
compared to those for native rat and rhesus intestinal enterocyte brush border 
membranes. A series of ezetimibe analogs was selected with subtle structural 
diversity, but with no binding affinities to native brush border membranes that 
covered a range of 1 000-fold. 

20 Table 12 shows a comparison of binding affinities (Ki values) for 

recombinant NPC1L1-293 cell membranes and native brash border membranes 
Selected analogs of ezetimibe glucuronide are compared against recombinant rat and 
human NPC1L1 membranes prepared from transiently transfected HEK-293 cells 
compared to native rat and rhesus brush border membranes. The binding assays were 

25 conducted in a final volume of 20 jjlI in the presence of 0.03% sodium taurocholate 
and 0.05% digitonin until equilibrium was achieved. 1 .25 mg protein/ml and 1 00 nM 
1 were used for native rat, recombinant rat, and recombinant human experiments, and 
1 .25 mg protein/ml and 20 nM 1 were used for native rhesus monkey experiments. 
Observed total and nonspecific binding, respectively, in the absence of inhibition 

30 were native rat: 7,700 & 1 ,100, recombinant rat: 33,000 & 1,100, native rhesus 
monkey: 7,300 & 367, and recombinant human: 19,200 8c 1,000 dpm. Analog 
structures are defined in Table 12. Compound 4 has the stereochemical configuration 
3S, 4R, and is the glucuronide of the enantiomer of ezetimibe. These determinations 

t 
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were conducted in buffer containing 0.03% taurocholate and 0.05% digitonin, levels 
below the critical micelle concentrations of these detergents. These conditions 
enhanced apparent binding by as much as 20-fold for the recombinant preparations 
(principally a Bmsx effect), and greatly facilitated a quantitative comparison of K* 

5 values for 1 and its analogs. 

As shown in Table 12, the Ki values for recombinant rat NPC1L1 and 
native rat brush border membranes are virtually identical, strongly suggesting that 
NPC1 LI is the molecular target of ezetimibe in vivo. In the case of membranes from 
cells expressing recombinant human NPCIL1, the binding affinities also parallel 

10 those observed in rat membranes, whereas binding affinities for native rhesus brush 
border membranes are uniformly -1 0-fold more potent. This result is consistent with 
the finding that ezetimibe is an order of magnitude more potent in monkey than in 
human or rat (Clader, J. W. The discovery of ezetimibe; A view from outside the 
receptor. J Med. Chem. 47, 1-9 (2004); Jeu, L. & Cheng, J.W. Pharmacology and 

15 therapeutics of ezetimibe (SCH 58235), a cholesterol-absorption inhibitor. Clin. Titer. 
25,2352-2387 (2003)). 
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TABLE 12 

OH 




5 



Analog 


X 


Y 


Rat 


Rat 


Human 


Monkey 








BBMKi 
(nM) 


NPC1L1 

Ki 

(nM) 


NPC1L1 
Ki 

(nM) 


BBMKi 
(nM) 


KZR-gtuc 
1 


OH (5) 


H 


390 


210 


220 


15 


4* 


ohor) 


H 


130,000 


74,000 


130,000 


22,000 


3 


H 


H 


1,600 


820 


1,000 


150 


6 


=o 


H 


33,000 


23,000 


14,000 


3,300 


5 


OH(*) 


H 


2,800 


1,700 


1,300 


120 


2 




OH 


280 


360 


210 


60 



glucuronide of the enantiomer of ezetimibe has stereochemical configuration 3S, AR 
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Conclusive evidence that NPC1L1 is the target of ezetimibe was 
provided by studies with tissues from NPC1 LI deficient mice. Enterocyte brush 
border membranes prepared from NPC1L1 deficient mice showed no detectable 
specific binding affinity for 1, whereas membranes from age-matched wild-type mice 
5 showed a high level of specific binding with a K D = 1 2 \sM (Figure 1 4). 

For Figure 14 A, enterocyte brush border membranes were prepare*! 
from NPC1L1 deficient male mice and same sex wild-type littermates, and tested for 
binding of 1. Conditions for binding were 5 mg/ml protein and 500 nM 1 in a volume 
of 20 yl and in the presence of 0.03% sodium taurocholate and 0.05% digitonin. 

10 Membranes from wild type mice are on the left and from NPC1L1 deficient mice on 
the right. The bar graphs indicate total binding (left bar), nonspecific binding in the 
presence of 500 |iM cold ezetimibe glucuronide (middle bar), and specific (right bar) 
binding, respectively for each of wild type and NPC1L1 deficient mice, and error bars 
represent triplicate measurements. The graphs show that although specific binding is 

1 5 readily detectable in wild-type mice, it is absent in NPC 1 Ll-deficient mice. 

Figure 14B shows a plot demonstrating competition of unlabeled 
ezetimibe glucuronide against X- Membranes from wild-type mice (upper curve) gave 
IQ = 12,000 nM, while specific binding was virtually undetectable in membranes from 
the knockout animals (lower curve). Conditions were those described in Figure 14A. 

20 The present studies involve a quantitative comparison of binding 

between recombinant proteins and brush border membranes. SR-BI (scavenger 
receptor type Bl) was previously identified as a potential target using an expression 
cloning strategy employing ezetimibe binding to candidate proteins; this hypothesis 
was readily dismissed when neither cholesterol absorption nor ezetimibe activity were 

25 affected in SR-BI deficient mice (Altmann, S. W. et al The identification of intestinal 
scavenger receptor class B, type 1 (SR-BI) by expression cloning and its role in 
cholesterol absorption. Biochem. Biophs. Acta 1580, 77-93 (2002)). The results show 
that ezetimibe binds to native intestinal membranes and cells expressing recombinant 
NPC1L1 with comparable affinity, and does not bind to membranes from NPC1L1 

30 deficient mice, indicating a specific binding interaction between NPC 1 LI and 
ezetimibe. Together with the previously published findings that mice deficient in 
NPC1L1 are defective in intestinal cholesterol uptake, and are no longer responsive to 
ezetimibe (Altmann, S. W. et al Niemann-Pick CI Like 1 protein is critical for 
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intestinal cholesterol absorption. Science 303, 1201-1204 (2004)), these data 
definitively establish NPC I LI as the direct target of ezetimibe. 

Example 36: Effect of detergents on [ 3 H]ezetimibe glucuronide binding. 

A practical aspect of work with the recombinant protein was that the 
5 number of binding sites in transfected NPC 1 LI -293 cell membranes initially appeared 
quite low. The influence of a combination of 0.03% taurocholate and 0.05% digitonin 
on specific binding to these and native enterocytc brush border membrane 
preparations is dramatic as illustrated in Figure 15. 

Equal amounts (25 jig protein) of rat brush border membranes, 

10 membranes from HEK-293 cells transiently expressing recombinant rat and human 
NPC 1 LI, were incubated with 25 nM 1 in a final volume of 20 jjI until equilibrium 
was achieved. The incubation conditions were buffer A with and without sodium 
taurocholate and digitonin to a final concentration of 0.03% and 0.05%, respectively. 
On the x-axis, "C" denotes controls in the absence of detergent, and "+det" the 

1 5 response in the presence of both detergents. The results are shown in 3 bar groupings; 
Total binding (left bar in each 3 bar group), nonspecific binding in the presence of 
100 unlabeled ezetimibe glucuronide (middle bar in each 3 bar group), and 
specific binding (right bar in each 3 bar group) are shown. 

Example 37: Binding affinities of ezetimibe glucuronide and various analogues 
20 NPC1L1 in rat and rhesus monkey membranes. 

As determined from binding assay results using 3 H-ezetimibe 
glucuronide with rat brush, border membrane, representative tested compounds of 
Formula II were determined to have ICso's of about 13,000nM or lower, and 
particularly certain tested compounds had ICso's of about 1900nM or lower, more 

25 particularly certain tested compounds had ICso's of about lOOOnM or lower, and 
most particularly certain tested compounds had ICso's of less than lOOnM. As 
determined from binding assay results using 3 H-ezetimibe glucuronide with rhesus 
brush border membrane, representative tested compounds of Formula II were 
determined to have ICso's of about 4200nM or lower, and particularly certain tested 

30 compounds had ICso's of about 1 65nM or lower, more particularly certain tested 
compounds had ICso's of less than lOOnM, and most particularly certain tested 
compounds had ICso's of less than 50nM. 
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The designations below are used in the Examples that follow for 
certain repetitively used intermediates: 





The compounds (3i?,4^-3-[(3^3-(4«fluorophenyl)-3-hydroxypropyl]-4-(4- 
10 hydroxyphenyl)-l-(4-iodophenyl)azetidin-2-one and 4-[(2aS,32*)-3-[(3S)-3-(4- 
fluorophenyl)-3 -hydroxypropyl]- 1 -(4-iodophenyl)-4-oxoazetidin-2-yl]phenyi methyl 
p-D-glucopyranosiduronate (i£Z) were prepared according to Burnett, D. S.; Caplen, 
M. A.; Domalski, M S.; Browne, M. E.; Davis, H. R. Jr.; Clader, J. W. Bioorg. Med. 
Chem. Lett. (2002), 72, 3 1 1 . Compound izE is the hydroxy-protected analog of 1=7, 
15 . where the protecting group is acyl. 
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The following definitions are also used in the Examples that follow: 

OAc 




EXAMPLE 38 

5 Preparation of iV-prop-2-vn-l-vlacet amide (i-1) 

Acetyl chloride (0.52 mL, 7.3 mmol) was added to a stiired solution of 
propargylamine (0.5 mL, 7.3 mmol) and dimethylaminopyridine (18 mg, 0*14 mmol) 
in pyridine (2.5 mL) at 0°C, and the resulting mixture was allowed to warm to 
ambient temperature. After approximately 1 5 h, the reaction mixture was diluted with 
10 ethyl acetate and washed successively with IN HC1 and brine. The organic phase was 
dried (Na2SO,|), filtered and concentrated in vacuo to afford the title compound (i-1), 
which was used without further purification. 

EXAMPLE 39 

Preparation of jV-prop-2-vn- 1 - vlbenzeriesulfonamide (i-2^ 
15 Benzene sulfonyl chloride (1.16 mL, 9. 1 mmol) was added to stirred 

solution of propargylamine (0.62 mL, 9.1 mmol) and dimethylaminopyridine (22 mg, 
0.18 mmol) in pyridine (5 mL) at room temperature. The resulting solution was aged 
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at ambient temperature for approximately 15 h. The reaction mixture was diluted 
with ethyl acetate and washed successively with IN HC1 and brine. The organic 
phase was dried (Na2SC>4), filtered and concentrated in vacuo to furnish the title 
compound (i-2), which was used without further purification. 

5 EXAMPLE 40 

Preparation of A^JST-Dimethyl-AT -prop-2-vn- 1 -vlurea (i-3^ 

Dimethyl carbamylchloride (0.84 mL, 9.1 mmol) was added to a 
stirred solution of propargylamine (0.62 mL, 9.1 mmol) and dimethyiaminopyridine 
(22 mg, 0. 1 8 mmol) in pyridine (5 mL) at room temperature. The resulting 
10 suspension was stirred at ambient temperature for approximately 15 h. The reaction . 
mixture was diluted with ethyl acetate and washed successively with IN HC1 and 
brine. The organic phase was dried (N^SO^), filtered and concentrated in vacuo to 
afford the title compound (i-3), which was used without further purification. 

EXAMPLE 41 

15 Preparation of A^-Methvl-A^prop-2- vn-1 -vliriethanesxilforflrnifle fi-4) 

Methansulfonylchloride (1.12 mL, 14.5 mmol) was added to a stirred 

solution of N-methylpropargylamine (1.22 mL, 14.5 mmol) and 

dimethyiaminopyridine (35 mg, 0.30 mmol) in pyridine (10 mL) at room temperature. 

After aging for approximately 15 h, the reaction mixture was poured into ethyl acetate 
20 and washed successively with IN HC1 and brine. The organic phase was dried 

(Na2S(>4), filtered and concentrated in vacuo, to afford the title compound (i-4), which 

was used without further purification. 

EXAMPLE 42 

Preparation of Mprop-2-vn-l-vlmethanesul fbnaTriide 

25 Methansulfonylchloride (1 .40 mL, 18.1 mmol) was added dropwise to 

a stirred solution of propargylamine (1.00 g, 18.1 mmol) and dimethyiaminopyridine 
(44.0 mg, 0.36 mmol) in pyridine (10 mL) at 0 °C. After aging for approximately 
15 h, the reaction mixture was poured into IN HC1 and extracted twice with ethyl 
acetate. The combined organic extracts were washed with saturated aqueous sodium 

30 bicarbonate, brine, dried (MgSCX*), filtered and concentrated in vacuo, to afford the 
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title compound i-5. Crude i-5 crystallized on standing and was used without further 
purification. 

EXAMPLE 43 

Preparation of7Sr-(3-(4-[(2S37gV3-ra 
5 hvdroxvphenyn-4oxoazetidin-l ■vlTphenvl}prt)p-2-vn-l -vnmethanesulfonamide 
(Compound 6al 




Triethylamine (? equivalents) is added to a solution of (3R,4S)-3-[(3S)- 
3-(4-fluorophenyl)-3-hydroxypropyl]-^ 

10 2-one (i-6) (1.00 equivalent), ^-prop-2-yn-l-ylmethanesulfonamide (1.50 

equivalents), tetrakistriphenylphosphine palladium(0) (0.15 equivalents) and copper(I) 
iodide (0.30 equivalents) in DMF (0. 1 M concentration with respect to final product) 
under a nitrogen atmosphere and the resulting solution aged at room temperature. 
After completion of reaction, the volatiles are evaporated in vacuo and the crude 

1 5 residue can be purified by flash chromatography on silica gel to afford the title 
compound. 
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EXAMPLE 44 



Step A: 



Preparation of 4-[f25'.3i?V 3-fr3^3>f4-fluoroDhen^V3> 
h vdrox vpropvl]- 1 -(4- {3 - [(methvlsvdfonvD arnino]prop- 1 -vn- 1 ■ 
yl}phenvfl-4-oxoazetidin-2-vHprienvl methvi B-D- 
gluco pyranosiduronate (Compound 7al 




Triethylamine (0.07 mL, 0.502 mmol) was added to a stirred solution 
of4-[(2S,3i0-3-[(3iS)-3-(4-fluoroph^ 

oxoazetidin-2-yl]phenyl methyl {3-D-gtucopyranosiduronate (i£Z) (0.050 g, 0.071 
1 0 mmol), jNT-prop-2-yn-l -ylmethanesulfonamide (i-5) (0.01 4 g, 0. 1 05 mmol), 

tetrakistriphenylphosphine palladium(0) (0.012 g, 0.010 mmol) and copper iodide 
(0.005 g, 0.026 mmol) in DMF (0.5 mL) under a nitrogen atmosphere and the 
resulting solution aged at room temperature for 18 h. The volatdles were evaporated 
in vacuo and the crude residue purified by flash chromatography on silica gel 
1 5 (gradient elution; 0-25% methanol/methylene chloride as eluent) to afford the title 
compound; m/z (ES) 713 (MH 4 ), 505. 
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Step B: 



Pre paration of 44^3^-3-rn ^>f4-fluorot)henvlV3> 
hvdroxvDropv11-l -r4- O^^ fithylsulfonvnaminolproT^l -vn-1 - 
vl}phenvn-4-Qxoazetidin^ -^lp^envl B-D-rfucopvranosiduronic acid 
fCompound 7b. also referred to he rein as compound 2^ 




A solution of compound 7a in methanol/water/triethylamine (1 :7:2; 
1 mL) was stirred at room temperature for approximately 1.5 h. The volatiles were 
evaporated in vacuo and the crude residue purified by preparative reversed phase high 
performance liquid chromatography on YMC Pack Pro CI 8 phase (gradient elution; 
1 0 1 0-65% acetonitrile/water as eluent, 0. 1 % TFA modifier) to give the title compound 
(7b); mlz (ES) 699 (MH 4 ), 505; HRMS (ES) mlz calcd for C34H36FN2O11S (MH*) 
699.2024, found 699.2016. 

EXAMPLE 45 (COMPOUNDS 6B TO 6G AND 7C TO 7N) 

The following compounds of Formula Ha have been prepared (as 
1 5 indicated by MS data provided) or can be prepared using the general synthetic 

procedures described in Example 43 (shown in Table 13) or Example 44 (shown in 
Table 14). 
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TABLE 13 



Compound 


R12 


R9 


6b> 


H 




6c) 


H 


>V H 

^»00« fvl - 

o 


66) 


H 


Me 

o 2 


6e) 


H 




6f) 


H 


O 


6g) 


H 


Me 

Me 
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TABLE 14 



Compound 


R12 


R9 


w/z(ES) 


HRJMS 
»i/zCES) 


7c) 


methyl ester 
glucuronide 


^\^NH 2 


658 (MNa 4 ) 




7d) 


glucuronide 


y^. 


621 (MH 4 ) 


(MKT 4 ) 
Calcd 621.2249 
Found 621.2223 


/ ©J 


xncuiyx wo lci 
glucuronide 


o 


677 (MH 4 ) 




7f\ 
'V 


giucuronioe 


H 

o 


663 (MH 4 ) 


Calcd 663.2354 
Found 663.2331 




tneuiyi esier 
glucuronide 


o 2 


749 (MNa 4 ) 




7h) 


glucuronide 


y Me 
^\ ^N. „Me 


735 (MNa*) 


our 4 ) 

Calcd 713.2180 
Found 713.2170 


7i) 


methyl ester 
glucuronide 


y^ H 

o 2 


797 (MNa*) 




7j) 


glucuronide 


o 2 


783 (MNa 4 ) 


(MH 4 ") 
Calcd 761 -2180 
Found 761 .2193 


7k) 


methyl ester 
glucuronide 


*v£ , . Me 

|f Me 
O 


706 (MH 4 ) 
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TABLE 14 



Compound 


R12 


R9 


m/z(ES) 


HRMS 
mlz (ES) 


71) 


ghicuronide 


H J. 
Y Me 
O 


692 (MH*) 


(MH*) 
Calcd 692.2620 
Found 692.2618 


7m) 


methyl ester 
glucuronide 


Me 
Me 


663 (MH 4 ) 




7n) 


ghicuronide 


^4 Me 


649 (MH+) 


(MH*) 
Calcd 649.2562 
Found 649.2532 



EXAMPLE 46 

Step A: Preparation of 4-((2£3 jgV3-[( 3SV3-( 4-fluorophenvlV3- 

hvdroxypropvll-4-oxo-l - {4-f (trimethvlsilvnethynvl}phenvl ) azetidin- 
5 2-vfiphenyl methvl P-D-glucopvranosiduronate (8a) 

OH 




Triethylamine (69.0 jiL, 0.495 mmol) was added to a stirred solution 
of i^Z (50.0 mg, 0.071 mmol), trimethylsilyacetylene (12.0 jiL, 0.085 mmol), 
tetrakistriphenylphosphine palladium(0) (13.0 mg, 0.01 1 mmol) and copper iodide 
10 (5.10 mg, 0.028 mmol) in DMF (0.5 mL) under a nitrogen atmosphere and the 

resulting solution aged at room temperature for 18 h. The volatiles were evaporated 
in vacuo and the crude residue purified by flash chromatography on silica gel 
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(gradient elution; 0-25% methanol/methylene chloride as eluent) to afford the title 
compound (8a); mlz (ES) 660 (M-OH) + , 470. 

Step B: Preparation of 4- (( 2S3R\-\ -f 4-ftthynvlphenvlV3--r( 35V3-f 4- 

fluoro phenvl V3 >hvdroxvpmpv1]>4-oxoazetidin-2~vl } phenyl B-D- 
5 glucopvranosiduronic acid (8b) 



• 




A solution of 8a in methanol/water/triethylamine 
(0.25 mL: 1.10 mL:0.40 mL) was stirred at room temperature for approximately 6 h. 
The volatiles were evaporated in vacuo and the crude residue purified by preparative 
1 0 reversed phase high performance liquid chromatography on YMC Pack Pro CI 8 

phase (gradient elution; 10-65% acetonitrile/water as eluent, 0.1M> TFA modifier) to 
give the title compound (8b); mlz (ES) 574 (M-OH) + , 398; HRMS (ES) mlz calc'd for 
C32H 3 iFN09 (MH*) 592.1983, found 592.1985. 

EXAMPLE 47 

15 Step A: Pre paration of 4^rr2>S , .3/?V3-rf3»S , >-3-(4-fluorophea-vlV3- 

hydro* ypropvi yi -(4- f 3-f fmethvlsulfonvnainino1propvnphenYl)-4- 
oxoazetidin-2-vnphenvl methyl B-D-glucopvranosiduronate ( 9a) 

HQ 
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A mixture of 2a (40.0 mg, 0.056 mmol) and palladium (-8 mg of 10 
wt. % (dry basis) on activated carbon) in methanol (2 mL) was hydrogenated at 
atmospheric pressure for approximately 1 h. The reaction mixture was filtered 
througji a short plug of celite, eluting copiously with methanol, and the filtrate 
evaporated in vacuo to afford the title compound (2§); mlz (ES) 509 (M-sugar-OH) + . 
StepB: Preparation of 44r25 f .3/gV3^r3 l SV3^4~fluorophenvlV3>- 

hvdroxvpropvl]-l'f4-{3-rfmethvlsulfonvnamino1propvl>phenvlV4- 
oxoazetidin~2-vllphenvl B-D-riucopvranosiduronic acid f 9b) 




10 A solution of 9a in methanol/water/triethylamine (1 :7:2, 1 mL) was 

stirred at room temperature for approximately 1 h. The volatiles were evaporated in 
vacuo and the crude residue purified by preparative reversed phase high performance 
liquid chromatography on YMC Pack Pro CI 8 phase (gradient elution; 10-65% 
acetonitrile/water as eluent, 0.1% TFA modifier) to give the title compound (gb); mlz 

15 (ES) 735 (M+Na) + , 685 (M-OH) + , 509 (M-sugar-OH) + ; HRMS (ES) m/z calc'd for 
C34H39FN2O11S (MH*) 703.2337, found 703.2337. 

EXAMPLE 48 

Step A: Preparation of 4-{f2^3^V3>[(35V3-acetoxvV3-f4- 

fluorophenvDpropvl]- 1 -\ 4-( 3 - (\tert-bvfcvl( dimethvlsil vl] oxv} prop-1 - 
20 vn-l"Vl>phenvn-4-oxoa2etidin-2>vnphenvl methvl 2.3.4-tri-6>-acetvl- 

p-D-glucopvranosiduronate flOa^ 




OTBS 



Triethylamine (170 |aL, 1 .25 mmol) was added to a solution of M (1 56 
mg, 0.178 mmol), ^-butyldimethyl(2-propynyloxy)silane (43.0 fiL, 0.214 mmol), 
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dichlorobistriphenylphosphine palladium(II) (12.0 mg, 0.018 mmol) and copper 
iodide (7.00 mg, 0.036 mmol) in DMF (1.3 mL) under a nitrogen atmosphere and the 
resulting solution aged at room temperature for approximately 20 h. The reaction 
mixture was poured into saturated aqueous sodium bicarbonate and extracted twice 
5 with diethyl ether. The combined organic extracts were washed with water, brine, 
dried (MgSO/O, filtered and the filtrate concentrated in vacuo. Purification of the 
crude residue by flash chromatography on silica gel (gradient elution; 15-40% ethyl 
acetate/hexanes as eluent) afforded the title compound 10a. 
Step B: Preparation of 4- {(2^3i?V3-r(3^-3-(acetYloxYV 3- 

10 (4-fluoyophenyl)propyi;b 1 -QH3-hydroxyprop- \ -yn-1 - 

vlfyhenvl]-4-oxoazetidin-2-vl)phenvl methvl 
2,3,4-tri-O-acetvl-B-D-glucopvranosiduronate ( 10b) 




OH 



Tetrabutyiammonium fluoride hydrate (39.0 mg, 0.148 mmol) was 
1 5 added to 10a (136 mg, 0. 148 mmol) in tetrahydrofuran (1 .5mL), and the resulting 
solution aged at room temperature for 1 h. The reaction mixture was poured into 
saturated aqueous ammonium chloride and extracted twice with ether. The combined 
organic extracts were washed with saturated sodium bicarbonate, brine, dried 
(MgS04), filtered and concentrated in vacuo. Purification of the crude residue by 
20 flash chromatography on silica gel (50% ethyl acetate/hexanes) afforded the title 
compound 10b . 

Step C: Preparation of 4- f (2S3 jgV3>r(3^3^acetvloxvV 

3>f4-fluorophenvl>oropvll-4-oxo-l-f4>(3-oxoprop- 
l-vn-l-vlfohenvl]azetidin-2-vl}phenvl methvl 
25 2.3.4-tri-O-acetvl-p— D-glucopvranosiduronate (1 Pel 




10c ^^CHO 
Dess-Martin periodinane (33.0 mg, 0.077mmol) was added to a 
solution of 10b (62.0 mg, 0.077 mmol) and pyridine (31.0 jiL, 0.386 mmol) in 
dichloromethane (1 mL) at room temperature. After 30 min, the reaction mixture was 
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poured into saturated aqueous sodium bicaibonate, and extracted twice with ethyl 
acetate. The combined organic extracts were washed with water, brine, dried 
(MgS04>, filtered and concentrated in vacuo. Purification of the crude residue by 
flash chromatography on silica gel (gradient elution; 20-40% ethyl acetate/hexanes) 
afforded the title compound 10c . 

Step D: Preparation of 4- {( 2£3flV3-|Y3,SV3-( acetvloxvV3-(4- 

fluorophenvllpropvl1-l-[4^caiboxvethvnvl^henvl1-4^xoazetidin-»2- 
vllphenvl methyl 2.3.4- tri-O-acetvl-B-D-glucopvranosiduronate nod") 




^^C0 2 H 

10 An aqueous solution (0.1 mL) of sodium dihydrogenphosphate (9.00 

mg, 0.065 mmol) and sodium chlorite (5.00 mg, 0.055 mmol) was added to a solution 
of 10c (37.0 mg, 0.046 mmol) in tert-butyl alcohol (0.4 mL), dioxane (0.2 mL) and 
isobutylene (-0.1 mL) at room temperature. After 1.5 h, the reaction mixture was 
concentrated in vacuo and the crude residue triturated repeatedly with ethyl acetate. 

15 The organic washings were dried (Na2SC>4), filtered and concentrated in vacuo to 
afford the title compound lOd . 

Step E: Preparation of 4W(2£3flVl-[4-(carboxvethvnv^^ 

3-(4-fluorophenyl>3-frydire^ 
ft-D- glucop vrano siduronic acid ( IQe) 




20 10e ^CO z H 

A solution of 4-{(2S',3^)-3-[(3 l S)-3-(acetyloxy)-3-(4- 
fluorophenyl)propyl]-l-[4-(carboxyethynyl)phenyl]^oxoazetidin-2-yl}phenyl 
methyl 2,3,4-tri-O-acetyl-p-D-glucopyranosiduronate (lOd) and sodium cyanide 
(~lmg, 0.020 mmol) in methanol (3mL) was heated to 45°C. After 22 h, the reaction 
25 mixture was concentrated under reduced pressure and dissolved in 

methanol/water/triethylamine (1 :7:2, 1 mL). After stirring at room temperature for 
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approximately 1 h, the volatiles were evaporated in vacuo and the crude residue 
purified by preparative reversed phase high performance liquid chromatography on 
YMC Pack Pro CI 8 phase (gradient elution; 10-60% acetonitrile/water as eluent, 
0.1% TFA modifier) to give the title compound Q0e), mlz (ES) 442.0 (M-sugar- 
5 OH) + , 618.0 (M-OH) + ; HRMS (ES) m/z calcd. for C33H31FNO11 (MH*) 636.1881, 
found 636.1889 

EXAMPLE 49 

Step A: Preparation of 4M2S3R)-3-\(3S)-3Ja<xtv loxvV3-( 4- 

fluorophenvl^ropyl1-l-{4^3-fethvlaminoV3-oxoproi>- 
10 1 -vn-1 ~yl' | phenvl}-4-oxoazetidin-2-vDphenvl methyl 

2.3.4-tri-O-acetvl-B-D-glucopvranosiduronate (1 la) 




AIM solution of ethylamine hydrochloride and diisopropylethylamine 
in DMF (40.0 \iL, 0.40 mmol) was added to 4-{(25,3/2)-3-[(3iS)-3-(acetyloxy)-3-(4- 

1 5 fluorophenyl)propyl]-l-[4-(carboxyethynyl)phenyl]-4-oxoazetidin-2-yl}pheny 

methyl 2,3,4-tri-O-acetyl-p-D-glucopyranosiduronate (lOd) (27.0 mg, 0.033 mmol), 
l-[3-(dimethylamino)propyl]-3^thylcarbodiimide hydrochloride (19.0 mg, 0.099 
mmol) and l-hydroxybenzotriazole (8.00 mg, 0.059 mmol) in DMF (0.25 mL). After 
4.5 h, the reaction mixture was poured into ethyl acetate and washed successively 

20 with water and brine. The organic layer was dried, filtered and concentrated under 
reduced pressure. Purification of the crude residue by flash chromatography on silica 
gel (gradient elution; 50-60% ethyl acetate/hexanes) afforded the title compound Ha. 
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Step B: Pre P ar^onof4-{f^-3i?Vl-{ A -r^-f pt H^ innV ^- nY "P"- t P- 1 -Y"- 

l-vl 1 P henvn-3-r^^-3-f4-f 1ii"rn P henv1V3-hvdroxvDropvn-4- 

oxoazetidip -^-V ^phenvl p-O-glTicop yranosiduronic acid (UV\ 



OH 




o 



5 A solution of 4-((2^,3ie)-3-[(35}-3-(acetyloxy)-3-(4- 

fluorophenyl)propylJ-l - {4-(3 -(ethylamino)-3-oxoprop- 1 -yn- 1 -yl]phenyl} -4- 
oxoazetidin-2-yl)phenyl methyl 2,3,4-tri-O-acetyl-p-D-glucopyranosiduronate C2Ja) 
(22.0 mg, 0.026 mmol) and sodium cyanide (~lmg, 0.020 mmol) in methanol (3mL) 
was heated to 45°C. After 18 h, the reaction mixture was concentrated under reduced 

10 pressure and dissolved in methanol/water/triethylamine (1:3:1, 2.5 mL). After stirring 
at room temperature for approximately 1 h, the volatiles were evaporated in vacuo, 
and the crude residue purified by preparative reversed phase high performance liquid 
chromatography on YMC Pack Pro CI 8 phase (gradient elution; 10-60% 
acetonitrile/water as eluent, 0.1% TFA modifier) to give the title compound Gib) mlz 

1 5 (ES) 663.0 (M+H) + ; HRMS (ES) m/z calcd. for CasHs^FNzOw (MH*) 663.2354, found 
663.2341. 

***************************** 

The present invention is not to be limited in scope by the specific 
embodiments described herein. Indeed, various modifications of the invention in 
20 addition to those described herein will become apparent to those skilled in the art 
fit>m the foregoing description. Such modifications are intended to fall within the 
scope of the appended claims. 

Patents, patent applications, publications, product descriptions, 
Genbank Accession Numbers and protocols are cited throughout this application, the 
25 disclosures of which are incorporated herein by reference in their entireties for all 
purposes. 
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CLAIMS 

1 . A method for identifying a ligand of NPC1L1 comprising: 

contacting human NPC1L1 with a detectably labeled substituted 2-azetidinone 
glucuronide and a candidate compound; and 

determining whether said candidate compound binds to human NPC1L1 ; 
wherein binding of said candidate compound to human NPC1L1 modulates 
binding of said detectably labeled substituted 2-azetidinone glucuronide to human 
NPC1L1, wherein the detectably labeled substituted 2-azetidinone glucuronide has a 
binding affinity K D value for human NPC1L1 that is 200nM or lower, and wherein 
said modulation indicates that the candidate compound is a ligand that binds to human 
NPC1L1. 

2. The method of claim 1, wherein the Kd value is lOQnM or lower. 

3. The method of claim 1, wherein the Kd value is 50nM or lower. 

4. The method of claim 1, wherein the Kd value is 20nM or lower. 

5. The method of claim 1, wherein the Kd value is lOnM or lower. 

6. The method of claim 1, wherein the substituted 2-azetidinone-glucuronide 
is selected from the group consisting of a compound of Formula I and a compound of 
Formula II. 

7. The method of claim 6, wherein the substituted 2-azetidinone-glucuronide 
comprises a detectable label from the group consisting of 35 S and i25 I. 

8. The method of claim 7, wherein the detectable label is 33 S. 

9. The method of claim 6, wherein the substituted 2-azetidinone-glucuronide 
is a compound of Formula II, wherein R 9 comprises an -SO2- group. 

10. The method of claim 9, wherein the substituted 2-azetidinone-glucuronide 
of Formula II is labeled with 35 S. 
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1 1 . A method for identifying a ligand of NPC1L1 comprising: 

contacting human NPC1L1 with a detectably labeled substituted 2-azetidinone 
glucuronide of Formula II and a candidate compound; and 

determining whether said candidate compound binds to human NPC1L1 ; 
wherein binding of said candidate compound to human NPC1L1 modulates 
binding of said detectably labeled substituted 2-azetidinone glucuronide of Formula II 
to human NPC1L1, and wherein said modulation indicates that the candidate 
compound is a ligand that binds to human NPC1L1. 

12. The method of claim 11, wherein R 9 of the detectably labeled substituted 
2-azetidinone glucuronide of Formula II comprises an -SO2- group. 

13. The method of claim 1 1, wherein the detectably labeled substituted 2- 
azetidinone glucuronide of Formula II is labeled with 35 S. 

14. The method of claim 11, wherein the detectably labeled substituted 2- 
azetidinone glucuronide of Formula II has a binding affinity K D value for human 
NPC1L1 that is 200nM or lower. 

15. The method of claim 14, wherein the K D value is lOOnM or lower. 

16. The method of claim 14, wherein the K© value is 50nM or lower. 

1 7. The method of claim 14, wherein the Kd value is 20nM or lower. 

18. The method of claim 14, wherein the Kd value is lOnM or lower. 

19. The method of claim 1 wherein the detectably labeled substituted 2- 
azetidinone glucuronide is labeled with S. 

20. The method of claim 1 wherein the detectably labeled substituted 2- 
azetidinone glucuronide is 35 S-labeled compound 2. 
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FIGURE 12 
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FIGURE 13 
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FIGURE 14 
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FIGURE 15 
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SEQUENCE LISTING 
<110> Garcia-Calvo, Margarita 

<120> NPC1L1 (NPC3) AND METHODS OF IDENTIFYING LIGANDS THEREOF 

<130> 36134-PCT 074669.0111 

<140> To Be Assigned 

<141> 2005-01-14 

<150> US 60/537,341 

<151> 2004-01-16 

<160> 51 

<170> Patentln version 3.1 

<210> 1 

<211> 3996 

<212> DNA 

<213> Rattus sp. 

<220> 

<221> CDS 

<222> (1)..(3996) 

<223> 

<400> 1 

atg gca get gec tgg ctg gga tgg ctg etc tgg gec ctg etc ctg age 

Met Ala Ala Ala Trp Leu Gly Trp Leu Leu Trp Ala Leu Leu Leu Ser 

1 5 10 15 

gcg gee cag ggt gag eta tac aca ccc aaa cac gaa get ggg gtc tgc 
Ala Ala Gin Gly Glu Leu Tyr Thr Pro Lys His Glu Ala Gly Val Cys 

20 25 30 

ace ttt tac gaa gag tgc ggg aaa aac cca gag etc tct gga ggc etc 
Thr Phe Tyr Glu Glu Cys Gly Lys Asn Pro Glu Leu Ser Gly Gly Leu 



48 
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144 
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35 40 45 

acg tea eta tec aat gta tec tgc ctg tct aac ace ccg gee cgc cac 192 
Thr Ser Leu Ser Asn Val Ser Cys Leu Ser Asn Thr Pro Ala Arg His 
50 55 60 

gtc acg ggt gaa cac ctg get ctt etc cag cgc ate tgt ccc cgc ctg - 240 
Val Thr Gly Glu His Leu Ala Leu Leu Gin Arg He Cys Pro Arg Leu 
65 70 75 80 

tac aac ggc ccc aat ace act ttt gee tgt tgc tct ace aag cag ctg 288 
Tyr Asn Gly Pro Asn Thr Thr Phe Ala Cys Cys Ser Thr Lys Gin Leu 

85 90 95 

ctg tec tta gaa age age atg tec ate ace aag gee ctt etc acg cgc 336 
Leu Ser Leu Glu Ser Ser Met Ser He Thr Lys Ala Leu Leu Thr Arg 

100 105 110 

tgc ccg gee tgc tct gac aat ttt gtg age t*ta cac tgc cac aac act 384 
Cys Pro Ala Cys Ser Asp Asn Phe Val Ser Leu His Cys His Asn Thr 
115 120 125 

tgc age cct gac cag age etc ttc ate aac g-tc ace egg gtg gtt gag 432 
Cys Ser Pro Asp Gin Ser Leu Phe He Asn Val Thr Arg Val Val Glu 
130 135 140 

egg ggc get gga gag cct cct gee gtg gtg gec tat gag gec ttt tat 4 80 

Arg Gly Ala Gly Glu Pro Pro Ala Val Val AJ.a Tyr Glu Ala Phe Tyr 
145 150 % 155 160 

cag cgc age ttt get gag aag gee tat gag tec tgc age cag gtg cgc 528 
Gin Arg Ser Phe Ala Glu Lys Ala Tyr Glu Ser Cys Ser Gin Val Arg 

165 170 175 

ate cct gcg gec get tec ttg gee gtg ggc age atg tgt gga gtg tat 57 6 

He Pro Ala Ala Ala Ser Leu Ala Val Gly Ser Met Cys Gly Val Tyr 

180 185 190 

ggc tec gec etc tgc aat get cag cgc tgg etc aac ttc caa gga gac 624 
Gly Ser Ala Leu Cys Asn Ala Gin Arg Trp Leu Asn Phe Gin Gly Asp 
195 200 205 

aca ggg aat ggc ctg get ccg ctg gat ate acc ttc cac etc ttg gag 672 
Thr Gly Asn Gly Leu Ala Pro Leu Asp He Trir Phe His Leu Leu Glu 
210 215 220 

cct ggc cag gec eta ccg gat ggg ate cag cca ctg aat ggg aag ate 720 
Pro Gly Gin Ala Leu Pro Asp Gly He Gin Piro Leu Asn Gly Lys He 
225 230 235 240 

gca ccc tgc aac gag tct cag ggt gat gac tea gca gtc tgc tec tgc 768 
Ala Pro Cys Asn Glu Ser Gin Gly Asp Asp Ser Ala Val Cys Ser Cys 

245 250 255 

cag gac tgt gcg gcg tec tgc cct gtc ate cct ccg ccc gag gee ttg 816 
Gin Asp Cys Ala Ala Ser Cys Pro Val He Pro Pro Pro Glu Ala Leu 

260 265 270 

cgc cct tec ttc tac atg ggt cgc atg cca ggc tgg ctg gee etc ate 864 
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Arg Pro Ser Phe Tyr Met Gly Arg Met Pro Gly Trp Leu Ala Leu lie 
275 280 285 

ate ate ttc act get gtc ttt gtg ttg etc tct gca gtc ctt gtg cgt 912 
lie lie Phe Thr Ala Val Phe Val Leu Leu Ser Ala Val Leu Val Arg 
290 295 300 

etc cga gtg gtt tec aac agg aac aag aac aag gca gaa ggc ccc cag 960 
Leu Arg Val Val Ser Asn Arg Asn Lys Asn Lys Ala Glu Gly Pro Gin 
305 310 315 320 

gaa gec ccc aaa etc cct cat aag cac aaa etc tea ccc cat ace ate 1008 
Glu Ala Pro Lys Leu Pro His Lys His Lys Leu Ser Pro His Thr lie 

325 330 335 

ctg ggc egg ttc ttc cag aac tgg ggc aca agg gtg gee teg tgg cca 1056 
Leu Gly Arg Phe Phe Gin Asn Trp Gly Thr Arg Val Ala Ser Trp Pro 

340 345 " 350 

etc ace gtc tta gca ctg tec ttc ate gtt gtg ata gec tta gca gca 1104 
Leu Thr Val Leu Ala Leu Ser Phe lie Val Val lie Ala Leu Ala Ala 
355 360 365 

ggc ctg acc ttt att gaa etc acc aca gac cct gtg gaa ctg tgg teg 1152 
Gly Leu Thr Phe lie Glu Leu Thr Thr Asp Pro Val Glu Leu Trp Ser 
370 375 380 



gee ccc aag age cag gee egg aaa gag aag tct ttc cat gat gag cat 1200 
Ala Pro Lys Ser Gin Ala Arg Lys Glu Lys Ser Phe His Asp Glu His ' 
385 390 395 400 

ttc ggc ccc ttc ttt cga acc aac cag att ttc gtg aca get egg aac ' 124 8 
Phe Gly Pro Phe Phe Arg Thr Asn Gin lie Phe Val Thr Ala Arg Asn 

405 410 415 

agg tec age tac aag tac gac tec eta ctg eta ggg tec aag aac ttc 1296 
Arg Ser Ser Tyr Lys Tyr Asp Ser Leu Leu Leu Gly Ser Lys Asn Phe 

420 425 430 

agt ggg ate ctg tec ctg gac ttc ctg ctg gag ctg ctg gag ctt cag 134 4 

Ser Gly lie Leu Ser Leu Asp Phe Leu Leu Glu Leu Leu Glu Leu Gin 
435 440 445 

gag agg ctt cga cac ctg caa gtg tgg tec cct gag gca gag cgc aac 1392 
Glu Arg Leu Arg His Leu Gin Val Trp Ser Pro Glu Ala Glu Arg Asn 
450 455 460 

ate tec etc cag gac ate tgc tat gee ccc etc aac cca tat aac acc 1440 
lie Ser Leu Gin Asp lie Cys Tyr Ala Pro Leu Asn Pro Tyr Asn Thr 
465 470 475 480 

age etc tec gac tgc tgt gtc aac age etc ctt cag tac ttc cag aac 1488 
Ser Leu Ser Asp Cys Cys Val Asn Ser Leu Leu Gin Tyr Phe Gin Asn 

485 490 " 495 

aac cgc acc etc ctg atg etc acg gec aac cag act ctg aat ggc cag 1536 
Asn Arg Thr Leu Leu Met Leu Thr Ala Asn Gin Thr Leu Asn Gly Gin 

500 505 510 
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acc tec ctg gtg gac tgg aag gac cat ttc etc tac tgt gca aat gec 1584 
Thr Ser Leu Val Asp Trp Lys Asp His Phe Leu Tyr Cys Ala Asn Ala 
515 520 525 

cct etc acg ttc aaa gat ggc acg tct ctg gec ctg age tgc atg get 1632 
Pro Leu Thr Phe Lys Asp Gly Thr Ser Leu Ala Leu Ser Cys Met Ala 
530 535 540 

gac tac ggg get cct gtc ttc ccc ttc ctt get gtt ggg gga -tac caa 1680 
Asp Tyr Gly Ala Pro Val Phe Pro Phe Leu Ala Val Gly Gly Tyr Gin 
54 5 550 555 " 560 

ggc acg gac tat tec gag gca gaa gcg ctg ate ata acc ttc -tct etc 1728 
Gly Thr Asp Tyr Ser Glu Ala Glu Ala Leu He He Thr Phe Ser Leu 

565 570 575 

aat aac tac ccc get gat gat ccc cgc atg gec cag gec aag etc tgg 1776 
Asn Asn Tyr Pro Ala Asp Asp Pro Arg Met Ala Gin Ala Lys Leu Trp 

580 585 590 

gag gag get ttc ttg aag gaa atg gaa tec ttc cag agg aac stca agt 1824 
Glu Glu Ala Phe Leu Lys Glu Met Glu Ser Phe Gin Arg Asn Thr Ser 
595 600 605 

gac aag ttc cag gtt gcg ttc tea get gag cgc tct ctg gag cjat gag 1872 
Asp Lys Phe Gin Val Ala Phe Ser Ala Glu Arg Ser Leu Glu Asp Glu 
610 615 620 

ate aac cgc acc acc ate cag gac ctg cct gtc ttt gee gtc age tac 1920 
He Asn Arg Thr Thr He Gin Asp Leu Pro Val Phe Ala Val Ser Tyr 
625 630 635 640 

att ate gtc ttc ctg tac ate tec ctg gee ctg ggc age tac tec aga 1968 
lie He Val Phe Leu Tyr He Ser Leu Ala Leu Gly Ser Tyr Ser Ara 

645 650 



tgc age cga gta gcg gtg gag tec aag get act ctg ggc eta ggt ggg 2016 
Cys Ser Arg Val Ala Val Glu Ser Lys Ala Thr Leu Gly Leu Gly Gly 

660 665 670 

gtg att gtt gtg ctg gga gca gtt ctg get gec atg ggc ttc tac tec 2064 
Val He Val Val Leu Gly Ala Val Leu Ala Ala Met Gly Phe Tyr Ser 
675 680 685 

tac ctg ggt gtc ccc tct tct ctg gtt ate ate caa gtg gta cct ttc 2112 
Tyr Leu Gly Val Pro Ser Ser Leu Val He He Gin Val Val Pro Phe 
690 695 700 

ctg gtg eta get gtg gga get gac aac ate ttc ate ttt gtt ctt gag 2160 
Leu Val Leu Ala Val Gly Ala Asp Asn He Phe He Phe Val Leu Glu 
705 710 715 720 

tac cag agg eta cct agg atg cct ggg gaa cag cga gag get cac att 2208 
Tyr Gin Arg Leu Pro Arg Met Pro Gly Glu Gin Arg Glu Ala Has He 

725 730 735 

ggc cgc acc ctg ggc agt gtg gee ccc age atg ctg ctg tgc acre etc 2256 
Gly Arg Thr Leu Gly Ser Val Ala Pro Ser Met Leu Leu Cys Ser Leu 

74 ° 745 750 
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tct gag gcc ate tgc ttc ttt eta ggg gee ctg ace ccc atg cca get 2304 
Ser Glu Ala lie Cys Phe Phe Leu Gly Ala Leu Thr Pro Met Pro Ala 
755 760 765 

gtg agg ace ttc gcc ttg ace tct ggc tta gca att ate etc gac ttc 2352 
Val Arg Thr Phe Ala Leu Thr Ser Gly Leu Ala lie lie Leu Asp Phe 
770 775 780 

ctg etc cag atg act gcc ttt gtg gcc ctg etc tec ctg gat age aag 2400 
Leu Leu Gin Met Thr Ala Phe Val Ala Leu Leu Ser Leu Asp Ser Lys 
785 790 795 800 

agg cag gag gcc tct cgc ccg gat gtc tta tgc tgc ttt tea acc egg 2448 
Arg Gin Glu Ala Ser Arg Pro Asp Val Leu Cys Cys Phe Ser Thr Arg 

805 810 815 

aag ctg ccc cca cct aaa gaa aaa gaa ggc etc tta etc cgc ttc ttc 2496 
Lys Leu Pro Pro Pro Lys Glu Lys Glu Gly Leu Leu Leu Arg Phe Phe 

820 825 830 

cgc aag ata tac get cct ttc ctg ctg cac aga ttc ate cgc cct gtt 254 4 

Arg Lys lie Tyr Ala Pro Phe Leu Leu His Arg Phe lie Arg Pro Val 
835 840 845 

gtg atg ctg ctg ttt ctg acc ctg ttt gga gca aat etc tac tta atg 2592 
Val Met Leu Leu Phe Leu Thr Leu Phe Gly Ala Asn Leu Tyr Leu Met 
850 855 860 

tgc aac ate aac gtg ggg eta gac cag gag ctg get ctg ccc aag gac 264 0 

Cys Asn lie Asn Val Gly Leu Asp Gin Glu Leu Ala Leu Pro Lys Asp 
865 870 875 880 

■ 

teg tac ttg ata gac tac ttc etc ttt ctg aac cga tac ctt gaa gtg 2688 
Ser Tyr Leu lie Asp Tyr Phe Leu Phe Leu Asn Arg Tyr Leu Glu Val 

885 890 895 

ggg cct cca gtg tac ttt gtc acc acc teg ggc ttc aac ttc tec age 2736 
Gly Pro Pro Val Tyr Phe Val Thr Thr Ser Gly Phe Asn Phe Ser Ser 

900 905 910 

gag gca ggc atg aac gcc act tgc tct age gca ggc tgt aag age ttc 2784 
Glu Ala Gly Met Asn Ala Thr Cys Ser Ser Ala Gly Cys Lys Ser Phe 
915 920 925 

tec eta acc cag aaa ate cag tat gcc agt gaa ttc cct gac cag tct 2832 
Ser Leu Thr Gin Lys lie Gin Tyr Ala Ser Glu Phe Pro Asp Gin Ser 
930 935 940 



2880 



tac gtg get att get gca tec tec tgg gta gat gac ttc ate gac tgg 
Tyr Val Ala He Ala Ala Ser Ser Trp Val Asp Asp Phe He Asp Trp 
945 950 955 960 

ctg acc ccg tec tec tec tgc tgt cgc ctt tat ata cgt ggc ccc cat 2928 
Leu Thr Pro Ser Ser Ser Cys Cys Arg Leu Tyr He Arg Gly Pxro His 

965 970 " 975 

aag gat gag ttc tgt ccc tea acg gat act tec ttc aac tgc tta aaa 2976 
Lys Asp Glu Phe Cys Pro Ser Thr Asp Thr Ser Phe Asn Cys Leu Lys 
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980 



985 



990 



aac tgc atg aac cgc act ctg ggt cct gtg agg ccc aca gcg gaa cag 
Asn Cys Met Asn Arg Thr Leu Gly Pro Val Arg Pro Thr Ala Glu Gin 
995 1000 1005 



3024 



ttt cat aag tac ctg ccc tgg ttc ctg aat gat ccg ccc aat ate 
Phe His Lys Tyr Leu Pro Trp Phe Leu Asn Asp Pro Pro Asn He 
1010 1015 1020 



3069 



aga tgt ccc aaa ggg ggt eta gca gcg tat aga acg tct gtg aat 
Arg Cys Pro Lys Gly Gly Leu Ala Ala Tyr Arg Thr Ser Val Asn 
1025 1030 1035 

ttg age tea gat ggc cag gtt ata gec tec cag ttc atg gee tac 
Leu Ser Ser Asp Gly Gin Val He Ala Ser Gin Phe Met Ala Tyr 
1040 1045 1050 



3114 



3159 



cac aag ccc tta agg aac tea cag gac ttc aca gaa get etc egg 
His Lys Pro Leu Arg Asn Ser Gin Asp Phe Thr Glu Ala Leu Arg 
1055 1060 1065 

gcg tec egg ttg eta gca gee aac ate aca get gac eta egg aag 
Ala Ser Arg Leu Leu Ala Ala Asn He Thr Ala Asp Leu Arg Lys 
1070 1075 1080 

gtg cct ggg aca gat cca aac ttt gag. gtc ttc cct tac acg ate 
Val Pro Gly Thr Asp Pro Asn Phe Glu Val Phe Pro Tyr Thr He 
1085 1090 1095 

tec aac gtg ttc tac cag caa tac ctg acg gtc ctt cct gag gga 
Ser Asn Val Phe Tyr Gin Gin Tyr Leu Thr Val Leu Pro Glu Gly 
1100 1105 1110 

ate ttc ace ctt get ctt tgc ttt gtg ccc ace ttt gtt gtc tgc 
He Phe Thr Leu Ala Leu Cys Phe Val Pro Thr Phe Val Val Cys 
1115 1120 1125 

tac etc eta ctg ggc ctg gac atg tgc tea ggg ate etc aac eta 
Tyr Leu Leu Leu Gly Leu Asp Met Cys Ser Gly He Leu Asn Leu 
1130 1135 1140 



3204 



etc tec ate att atg att 
Leu Ser He He Met He 
1145 



etc gtg gac ace att ggc etc atg get 
Leu Val Asp Thr He Gly Leu Met Ala 
1150 1155 



gtg tgg ggt ate age tat aat gcg gta tec etc ate aac ctt gtc 

Val Trp Gly He Ser Tyr Asn Ala Val Ser Leu He Asn Leu Val 
1160 1165 1170 

acg gca gtg ggc atg tct gtg gag ttt gtg tec cac ate act egg 

Thr Ala Val Gly Met Ser Val Glu Phe Val Ser His He Thr Arg 
1175 1180 1185 

tec ttt get gta age acc aag cct ace egg ctg gag agg get aaa 

Ser Phe Ala Val Ser Thr Lys Pro Thr Arg Leu Glu Arg Ala Lys 
1190 1195 1200 



3249 



3234 



3339 



338 4 



3429 



347 4 



351 9 



3564 



360 9 



gat get act gtc ttc atg ggc agt gcg gtg ttt get gga gtg gec 



365 4 
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Asp Ala Thr Val Phe Met Gly Ser Ala Val Phe Ala Gly Val Ala 
1205 1210 1215 

atg acc aac ttc cca ggc ate etc ate ttg ggc ttt gee caa gee 3699 
Met Thr Asn Phe Pro Gly lie Leu lie Leu Gly Phe Ala Gin Ala 
1220 1225 1230 

cag ctt att cag ate ttc ttc ttc cgc etc aac ctt ctg ate acc 374 4 

Gin Leu lie Gin lie Phe Phe Phe Arg Leu Asn Leu Leu lie Thr 
1235 1240 1245 

ttg ctg ggt ctg ctg cat ggc ctg gtc ttc ctg ccg gtt gtc etc 3789 
Leu Leu Gly Leu Leu His Gly Leu Val Phe Leu Pro Val Val Leu 
1250 1255 1260 

age tat ctg gga cca gat gtt aac caa get ctg gta cag gag gag 3834 
Ser Tyr Leu Gly Pro Asp Val Asn Gin Ala Leu Val Gin Glu Glu 
1265 1270 1275 

aaa eta gee age gag gca gca gtg gee cca gag cct tct tgc cca 3879 
Lys Leu Ala Ser Glu Ala Ala Val Ala Pro Glu Pro Ser Cys Pro 
1280 1285 1290 

cag tac ccc tec cct get gat gcg gat gee aat gtt aac tac ggc 3924 
Gin Tyr Pro Ser Pro Ala Asp Ala Asp Ala Asn Val Asn Tyr Gly 
1295 1300 1305 

ttt gec cca gaa ctt gec cac gga get aat get get aga age tct 3969 
Phe Ala Pro Glu Leu Ala His Gly Ala Asn Ala Ala Arg Ser Ser 
1310 1315 1320 

ttg ccc aaa agt gac caa aag ttc taa 3996 
Leu Pro Lys Ser Asp Gin Lys Phe 
1325 1330 

<210> 2 

<211> 1331 

<212> PRT 

<213> Rattus sp. 



<400> 2 

Met Ala Ala Ala Trp Leu Gly Trp Leu Leu Trp Ala Leu Leu Leu Ser 
15 10 15 

Ala Ala Gin Gly Glu Leu Tyr Thr Pro Lys His Glu Ala Gly Val Cys 

20 25 30 

Thr Phe Tyr Glu Glu Cys Gly Lys Asn Pro Glu Leu Ser Gly Gly Leu 
35 40 45 
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Thr Ser Leu Ser Asn Val Ser Cys Leu Ser Asn Thr Pro Ala Arg His 
50 55 60 



Val Thr Gly Glu His Leu Ala Leu Leu Gin Arg lie Cys Pro Arg Leu 
65 70 75 80 



Tyr Asn Gly Pro Asn Thr Thr Phe Ala Cys Cys Ser Thr Lys Gin Leu 

85 90 95 



Leu Ser Leu Glu Ser Ser Met Ser lie Thr Lys Ala Leu Leu Thr Arg 

100 105 110 



Cys Pro Ala Cys Ser Asp Asn Phe Val Ser Leu His Cys His Asn Thr 
115 120 125 



Cys Ser Pro Asp Gin Ser Leu Phe lie Asn Val Thr Arg Val Val Glu 
130 135 140 



Arg Gly Ala Gly Glu Pro Pro Ala Val Val Ala Tyr Glu Ala Phe Tyr 
145 150 155 160 

Gin Arg Ser Phe Ala Glu Lys Ala Tyr Glu Ser Cys Ser Gin Val Arg 

165 170 175 

lie Pro Ala Ala Ala Ser Leu Ala Val Gly Ser Met Cys Gly Val Tyr 

180 185 190 

Gly Ser Ala Leu Cys Asn Ala Gin Arg Trp Leu Asn Phe Gin Gly Asp 
195 200 205 

Thr Gly Asn Gly Leu Ala Pro Leu Asp lie Thr Phe His Leu Leu Glu 
210 215 220 

Pro Gly Gin Ala Leu Pro Asp Gly He Gin Pro Leu Asn Gly Lys He 
225 230 235 240 

Ala Pro Cys Asn Glu Ser Gin Gly Asp Asp Ser Ala Val Cys Ser Cys 

245 250 255 

■ 

Gin Asp Cys Ala Ala Ser Cys Pro Val He Pro Pro Pro Glu Ala Leu 

260 265 270 



Arg Pro Ser Phe Tyr Met Gly Arg Met Pro Gly Trp Leu Ala Leu He 
275 280 285 
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He He Phe Thr Ala Val Phe Val Leu Leu Ser Ala Val Leu Val Arg 
290 295 300 

Leu Arg Val Val Ser Asn Arg Asn Lys Asn Lys Ala Glu Gly Pro Gin 
305 310 315 320 

Glu Ala Pro Lys Leu Pro His Lys His Lys Leu Ser Pro His Thr He 

325 330 335 

Leu Gly Arg Phe Phe Gin Asn Trp Gly Thr Arg Val Ala Ser Trp Pro 

340 • 345 350 

Leu Thr Val Leu Ala Leu Ser Phe He Val Val He Ala Leu Ala Ala 
355 360 365 

Gly Leu Thr Phe He Glu Leu Thr Thr Asp Pro Val Glu Leu Trp Ser 
370 375 380 

Ala Pro Lys Ser Gin Ala Arg Lys Glu Lys Ser Phe His Asp Glu His 
385 390 395 400 

Phe Gly Pro Phe Phe Arg Thr Asn Gin He Phe Val Thr Ala Arg Asn 

405 410 415 

Arg Ser Ser Tyr Lys Tyr Asp Ser Leu Leu Leu Gly Ser Lys Asn Phe 

420 425 430 

Ser Gly He Leu Ser Leu Asp Phe Leu Leu Glu Leu Leu Glu Leu Gin 
435 440 445 

Glu Arg Leu Arg His Leu Gin Val Trp Ser Pro Glu Ala Glu Arg JVsn 
450 455 4 60 

He Ser Leu Gin Asp He Cys Tyr Ala Pro. Leu Asn Pro Tyr Asn Thr 
465 470 475 480 

Ser Leu Ser Asp Cys Cys Val Asn Ser Leu Leu Gin Tyr Phe Gin -Asn 

485 490 495 

Asn Arg Thr Leu Leu Met Leu Thr Ala Asn Gin Thr Leu Asn Gly «31n 

500 505 510 



Thr Ser Leu Val Asp Trp Lys Asp His Phe Leu Tyr Cys Ala Asn Ala 
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515 



520 



525 



Pro Leu Thr Phe Lys Asp Gly Thr Ser Leu Ala Leu Ser Cys Met Ala 
530 535 540 



Asp Tyr Gly Ala Pro Val Phe Pro Phe Leu Ala Val Gly Gly Tyr Gin 
545 550 555 560 



Gly Thr Asp Tyr Ser Glu Ala Glu Ala Leu lie He Thr Phe Ser Leu 



Asn Asn Tyr Pro Ala Asp Asp Pro Arg Met Ala Gin Ala Lys Leu Trp 

580 585 590 



Glu Glu Ala Phe Leu Lys Glu Met Glu Ser Phe Gin Arg Asn Thr Ser 
595 600 605 



Asp Lys Phe Gin Val Ala Phe Ser Ala Glu Arg Ser Leu Glu Asp Glu 
610 615 620 



He Asn Arg Thr Thr He Gin Asp Leu Pro Val Phe Ala Val Ser Tyr 
625 630 635 640 



He He Val Phe Leu Tyr He Ser Leu Ala Leu Gly Ser Tyr Ser Arg 

645 650 655 



Cys Ser Arg Val Ala Val Glu Ser Lys Ala Thr Leu Gly Leu Gly Gly 

660 665 670 



Val He Val Val Leu Gly Ala Val Leu Ala Ala Met Gly Phe Tyr Ser 
675 680 685 



Tyr Leu Gly Val Pro Ser Ser Leu Val lie lie Gin Val Val Pro Phe 
690 695 700 



Leu Val Leu Ala Val Gly Ala Asp Asn He Phe He Phe Val Leu Glu 
705 710 715 720 



Tyr Gin Arg Leu Pro Arg Met Pro Gly Glu Gin Arg Glu Ala His He 



565 



570 



575 



725 



730 



735 



Gly Arg Thr Leu Gly Ser Val Ala Pro Ser Met Leu Leu Cys Ser Leu 

740 745 750 
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Ser Glu Ala He Cys Phe Phe Leu Gly Ala Leu Thr Pro Met Pro Ala 
755 760 765 

Val Arg Thr Phe Ala Leu Thr Ser Gly Leu Ala He He Leu Asp Phe 
770 775 780 

Leu Leu Gin Met Thr Ala Phe Val Ala Leu Leu Ser Leu Asp Ser Lys 
785 7 90 795 800 

Arg Gin Glu Ala Ser Arg Pro Asp Val Leu Cys Cys Phe Ser Thr Arg 

805 810 815 

Lys Leu Pro Pro Pro Lys Glu Lys Glu Gly Leu Leu Leu Arg Phe Phe 

820 825 830 

Arg Lys He Tyr Ala Pro Phe Leu Leu His Arg Phe He Arg Pro Val 
835 840 845 

Val Met Leu Leu Phe Leu Thr Leu Phe Gly Ala Asn Leu Tyr Leu Met 
850 855 860 

Cys Asn He Asn Val Gly Leu Asp Gin Glu Leu Ala Leu Pro Lys Asp 
865 870 875 880 

Ser Tyr Leu He Asp Tyr Phe Leu Phe Leu Asn Arg Tyr Leu Glu Val 

885 890 " 895 

Gly Pro Pro Val Tyr Phe Val Thr Thr Ser Gly Phe Asn Phe Ser Ser 

900 905 910 

Glu Ala Gly Met Asn Ala Thr Cys Ser Ser Ala Gly Cys Lys Ser P*>e 
915 920 925 

Ser Leu Thr Gin Lys He Gin Tyr Ala Ser Glu Phe Pro Asp Gin Ser 
930 935 940 

Tyr Val Ala He Ala Ala Ser Ser Trp Val Asp Asp Phe He Asp Trp 
y4:> 95 ° 955 960 

Leu Thr Pro Ser Ser Ser Cys Cys Arg Leu Tyr He Arg Gly Pro His 

965 97 0 975 



Lys Asp Glu Phe Cys Pro Ser Thr Asp Thr Ser Phe Asn Cys Leu Lys 

980 985 9 £ Q 
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Asn Cys Met Asn Arg Thr Leu Gly Pro Val Arg Pro Thr Ala Glu Gin 
995 1000 1005 

Phe Lys Tyr Leu Pro Tr P phe Leu Asn A SP p ro Pro Asn lie 

1010 1015 1020 

Ar9 ?£L Pr ° Lys G1 y G1 y Leu Ala Ala Arg Thr Ser Val Asn 
102 S 1030 1035 

LeU ?«f« Ser Asp Gly Gln Val Ile A 13 Ser Gin Phe Met Ala Tyr 
1040 1045 1050 

H1S c Pr ° LeU Arg Asn Ser Gln AsD Phe Thr Glu Ala Leu Arg 
1055 1060 1065 

?J!?n Arg LeU Leu Ala Ala Ile Thr Ala Asp Leu Arg Lys 

xu/0 1075 1080 

Val Gly Thr As P Pro Asn Phe Glu Val Phe Pro Tyr Thr Ile 

1085 1090 1095 

nn, Val Phe Tyr Gln Gln ^r. Leu Thr Val Leu Pro Glu Gly 
1100 1105 mo 

Ile Phe Thr Leu Ala Leu Cys Phe Val Pro Thr Phe Val Val Cys 
1115 1120 H25 

^ ???n LeU LeU Gly Leu Asp Met Cys Ser G1 y I>eu Asn Leu 

aiJ0 1135 . H40 

Leu Ser He He Met lie Leu Val Asp Thr Ile Gly Leu Met Ala 
1145 1150 H55 

Val ?f?n Gly Ile Ser Tyr Asn Ala Val Ser Leu Ile Asn Leu Val 
1160 H65 H70 

Thr A J a Val G ±y Met Ser Val Glu Phe Val Ser His Ile Thr Arg 
1175 1180 H85 

^ nSn AIa Val Ser Thr Lys Pro Thr Leu G ^ Arg Ala Lys 
1190 1195 1200 

ASP $ln* Thr Val Phe Met Glv Ser Ala Val Phe Ala Gly Val Ala 
a ^ 05 1210 1215 
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Met Thr Asn Phe Pro Gly lie Leu lie Leu Gly Phe Ala Gin Ala 
1220 1225 1230 



Gin Leu lie Gin lie Phe Phe Phe Arg Leu Asn Leu Leu lie Thr 
1235 1240 1245 



Leu Leu Gly Leu Leu His Gly Leu Val Phe Leu Pro Val Val Leu 
1250 1255 1260 



Ser Tyr Leu Gly Pro Asp Val Asn Gin Ala Leu Val Gin Glu Glu 
1265 1270 1275 



Lys Leu Ala Ser Glu Ala Ala Val Ala Pro Glu Pro Ser Cys Pro 
1280 1285 1290 



Gin Tyr Pro Ser Pro Ala Asp Ala Asp Ala Asn Val Asn Tyr Gly 
1295 1300 1305 



Phe Ala Pro Glu Leu Ala His Gly Ala Asn Ala Ala Arg Ser Ser 
1310 1315 1320 



Leu Pro Lys Ser Asp Gin Lys Phe 
1325 1330 



<210> 3 

<211> 3999 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDS 

<222> (1)..(3999) 

<223> 



<400> 3 

atg gcg gag gcc ggc ctg agg ggc tgg ctg ctg tgg gcc ctg etc ctg 48 

Met Ala Glu Ala Gly Leu Arg Gly Trp Leu Leu Trp Ala Leu Leu Leu 
1 5 10 15 
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cgc ttg gcc cag agt gag cct tac aca acc ate cac cag cct ggc tac 96 
Arg Leu Ala Gin Ser Glu Pro Tyr Thr Thr lie His Gin Pro Gly Tyr 

20 25 30 

tgc gcc ttc tat gac gaa tgt ggg aag aac cca gag ctg tct gga age 144 
Cys Ala Phe Tyr Asp Glu Cys Gly Lys Asn Pro Glu Leu Ser Gly Ser 
35 40 45 

etc atg aca etc tec aac gtg tec tgc ctg tec aac acg ccg gcc cgc 192 
Leu Met Thr Leu Ser Asn Val Ser Cys Leu Ser Asn Thr Pro Ala Arg 
50 55 60 

aag ate aca ggt gat cac ctg ate eta tta cag aag ate tgc ccc cgc 24 0 

Lys He Thr Gly Asp His Leu He Leu Leu Gin Lys He Cys Pro Arg 
65 70 75 80 

etc tac acc ggc ccc aac acc caa gcc tgc tgc tec gcc aag cag ctg 288 
Leu Tyr Thr Gly Pro Asn Thr Gin Ala Cys Cys Ser Ala Lys Gin Leu 

85 90 95 

gta tea ctg gaa gcg agt ctg teg ate acc aag gcc etc etc acc cgc 336 
Val Ser Leu Glu Ala Ser Leu Ser He Thr Lys Ala Leu Leu Thr Arg 

100 105 110 

tgc cca gcc tgc tct gac aat ttt gtg aac ctg cac tgc cac aac acg 384 
Cys Pro Ala Cys Ser Asp Asn Phe Val Asn Leu His Cys His Asn Thr 
115 120 125 

tgc age ccc aat cag age etc ttc ate aat gtg acc cgc gtg gcc cag 432 
Cys Ser Pro Asn Gin Ser Leu Phe He Asn Val Thr Arg Val Ala Gin 
130 135 140 

eta ggg get gga caa etc cca get gtg gtg gcc tat gag gcc ttc tac 4 80 

Leu Gly Ala Gly Gin Leu Pro Ala Val Val Ala Tyr Glu Ala Phe Tyr 
145 150 155 160 

cag cat age ttt gcc gag cag age tat gac tec tgc age cgt gtg cgc 528 
Gin His Ser Phe Ala Glu Gin Ser Tyr Asp Ser Cys Ser Arg Val Arg 

165 170 175 

gtc cct gca get gcc acg ctg get gtg ggc acc atg tgt ggc gtg tat 576 
Val Pro Ala Ala Ala Thr Leu Ala Val Gly Thr Met Cys Gly Val Tyr 

180 185 190 

ggc tct gcc ctt tgc aat gcc cag cgc tgg etc aac ttc cag gga gac 624 
Gly Ser Ala Leu Cys Asn Ala Gin Arg Trp Leu Asn Phe Gin Gly Asp 
195 200 * 205 

aca ggc aat ggt ctg gcc cca ctg gac ate acc ttc cac etc ttg gag 672 
Thr Gly Asn Gly Leu Ala Pro Leu Asp He Thr Phe His Leu Leu Glu 
210 215 220 

cct ggc cag gcc gtg ggg agt ggg att cag cct ctg aat gag ggg gtt 720 
Pro Gly Gin Ala Val Gly Ser Gly He Gin Pro Leu Asn Glu Gly Val 
225 230 235 240 

gca cgt tgc aat gag tec caa ggt gac gac gtg gcg acc tgc tec tgc 768 
Ala Arg Cys Asn Glu Ser Gin Gly Asp Asp Val Ala Thr Cys Ser Cys 

245 250 255 
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caa gac tgt get gca tec tgt cct gee ata gee cgc ccc cag gee etc 816 
Gin Asp Cys Ala Ala Ser Cys Pro Ala lie Ala Arg Pro Gin Ala Leu 

260 265 270 

gac tec ace ttc tac ctg ggc cag atg ccg ggc agt ctg gtc etc ate 864 
Asp Ser Thr Phe Tyr Leu Gly Gin Met Pro Gly Ser Leu Val Leu lie 
275 280 285 

ate ate etc tgc tct gtc ttc get gtg gtc acc ate ctg ctt gtg gga 912 
He He Leu Cys Ser Val Phe Ala Val Val Thr He Leu Leu Val Gly 
290 295 300 

ttc cgt gtg gec ccc gec agg gac aaa age aag atg gtg gac ccc aag 960 
Phe Arg Val Ala Pro Ala Arg Asp Lys Ser Lys Met Val Asp Pro Lys 
305 310 315 320 

aag ggc acc age etc tct gac aag etc age ttc tec acc cac acc etc 1008 
Lys Gly Thr Ser Leu Ser Asp Lys Leu Ser Phe Ser Thr His Thr Leu 

325 330 335 

ctt ggc cag ttc ttc cag ggc tgg ggc acg tgg gtg get teg tgg cct 1056 
Leu Gly Gin Phe Phe Gin Gly Trp Gly Thr Trp Val Ala Ser Trp Pro 

340 345 350 

ctg acc ate ttg gtg eta tct gtc ate ccg gtg gtg gee ttg gca gcg 1104 
Leu Thr He Leu Val Leu Ser Val He Pro Val Val Ala Leu Ala Ala 
355 360 365 

ggc ctg gtc ttt aca gaa etc act acg gac ccc gtg gag ctg tgg teg 1152 
Gly Leu Val Phe Thr Glu Leu Thr Thr Asp Pro Val Glu Leu Trp Ser 
370 375 380 

gee ccc aac age caa gec egg agt gag aaa get ttc cat gac cag cat 1200 
Ala Pro Asn Ser Gin Ala Arg Ser Glu Lys Ala Phe His Asp Gin His 
385 390 395 400 

ttc ggc ccc ttc ttc cga acc aac cag gtg ate ctg acg get cct aac 124 8 

Phe Gly Pro Phe Phe Arg Thr Asn Gin Val He Leu Thr Ala Pro Asn 

405 410 41S 

egg tec age tac agg tat gac tct ctg ctg ctg ggg ccc aag aac ttc 1296 
Arg Ser Ser Tyr Arg Tyr Asp Ser Leu Leu Leu Gly Pro Lys Asn Phe 

420 425 430 

age gga ate ctg gac ctg gac ttg ctg ctg gag ctg eta gag ctg cag 1344 
Ser Gly He Leu Asp Leu Asp Leu Leu Leu Glu Leu Leu Glu Leu Gin 
435 440 445 

gag agg ctg egg cac etc cag gta tgg teg ccc gaa gca cag cgc aac 1392 
Glu Arg Leu Arg His Leu Gin Val Trp Ser Pro Glu Ala Gin Arg Asn 
450 455 460 

ate tec ctg cag gac ate tgc tac gec ccc etc aat ccg gac aat acc 14 40 

He Ser Leu Gin Asp He Cys Tyr Ala Pro Leu Asn Pro Asp Asn Thr 
465 470 475 480 

agt etc tac gac tgc tgc ate aac age etc ctg cag tat ttc cag aac 1488 
Ser Leu Tyr Asp Cys Cys He Asn Ser Leu Leu Gin Tyr Phe Gin Asn 
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485 490 495 

aac cgc acg etc ctg ctg etc aca gec aac cag aca ctg atg ggg cag 1536 
Asn Arg Thr Leu Leu Leu Leu Thr Ala Asn Gin Thr Leu Met Gly Gin 

500 505 510 

acc tec caa gtc gac tgg aag gac cat ttt ctg tac tgt gec aat gee 1584 
Thr Ser Gin Val Asp Trp Lys Asp His Phe Leu Tyr Cys Ala Asn Ala 
515 520 525 

ccg etc acc ttc aag gat ggc aca gec ctg gee ctg age tgc atg get 1632 
Pro Leu Thr Phe Lys Asp Gly Thr Ala Leu Ala Leu Ser Cys Met Ala 
530 535 540 

gac tac ggg gee cct gtc ttc ccc ttc ctt gec att ggg ggg tac aaa 1680 
Asp Tyr Gly Ala Pro Val Phe Pro Phe Leu Ala lie Gly Gly Tyr Lys 
545 550 555 560 

gga aag gac tat tct gag gca gag gec ctg ate atg acg ttc tec etc 1728 
Gly Lys Asp Tyr Ser Glu Ala Glu Ala Leu lie Met Thr Phe Ser Leu 

565 570 575 

aac aat tac cct gec ggg gac ccc cgt ctg gee cag gee aag ctg tgg 1776 
Asn Asn Tyr Pro Ala Gly Asp Pro Arg Leu Ala Gin Ala Lys Leu Trp 

580 585 590 

gag gag gee ttc tta gag gaa atg cga gec ttc cag cgt egg atg get 1824 
Glu Glu Ala Phe Leu Glu Glu Met Arg Ala Phe Gin Arg Arg Met Ala 
595 600 605 

ggc atg ttc cag gtc acg ttc acg get gag cgc tct ctg gaa gac gag 1872 
Gly Met Phe Gin Val Thr Phe Thr Ala Glu Arg Ser Leu Glu Asp Glu 
610 615 620 

ate aat cgc acc aca get gaa gac ctg ccc ate ttt\ gec acc age tac 1920 
lie Asn Arg Thr Thr Ala Glu Asp Leu Pro lie Phe* Ala Thr Ser Tyr 
625 630 635 640 

att gtc ata ttc ctg tac ate tct ctg gec ctg ggc age tat tec age 1968 
lie Val lie Phe Leu Tyr lie Ser Leu Ala Leu Gly Ser Tyr Ser Ser 

645 650 655 

tgg age cga gtg atg gtg gac tec aag gee acg ctg ggc etc ggc ggg 2016 
Trp Ser Arg Val Met Val Asp Ser Lys Ala Thr Leu Gly Leu Gly Gly 

660 665 670 

gtg gee gtg gtc ctg gga gca gtc atg get gec atg ggc ttc ttc tec 2064 
Val Ala Val Val Leu Gly Ala Val Met Ala Ala Met Gly Phe Phe Ser 
675 680 685 

tac ttg ggt ate cgc tec tec ctg gtc ate ctg caa gtg gtt cct ttc 2112 
Tyr Leu Gly He Arg Ser Ser Leu Val He Leu Gin Val Val Pro Phe 
690 695 700 

ctg gtg ctg tec gtg ggg get gat aac ate ttc ate ttt gtt etc gag 2160 
Leu Val Leu Ser Val Gly Ala Asp Asn He Phe He Phe Val Leu Glu 
7 05 710 715 720 

tac cag agg ctg ccc egg agg cct ggg gag cca cga gag gtc cac att 2208 
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Tyr Gin Arg Leu Pro Arg Arg Pro Gly Glu Pro Arg Glu Val His He 

725 730 735 

ggg cga gcc eta ggc agg gtg get ccc age atg ctg ttg tgc age etc 2256 
Gly Arg Ala Leu Gly Arg Val Ala Pro Ser Met Leu Leu Cys Ser Leu 

740 745 750 

tct gag gcc ate tgc ttc ttc eta ggg gcc ctg acc ccc atg cca get 2304 
Ser Glu Ala He Cys Phe Phe Leu Gly Ala Leu Thr Pro Met Pro Ala 
755 760 765 

gtg egg acc ttt gcc ctg acc tct ggc ctt gca gtg ate ctt gac ttc 2352 
Val Arg Thr Phe Ala Leu Thr Ser Gly Leu Ala Val He Leu Asp Phe 
770 775 780 

etc ctg cag atg tea gcc ttt gtg gcc ctg etc tec ctg gac age aag 2400 
Leu Leu Gin Met Ser Ala Phe Val Ala Leu Leu Ser Leu Asp Ser Lys 
785 790 795 800 

agg cag gag gcc tec egg ttg gac gtc tgc tgc tgt gtc aag ccc cag 2448 
Arg Gin Glu Ala Ser Arg Leu Asp Val Cys Cys Cys Val Lys Pro Gin 

805 810 815 

gag ctg ccc ccg cct ggc cag gga gag ggg etc ctg ctt ggc ttc ttc 2496 
Glu Leu Pro Pro Pro Gly Gin Gly Glu Gly Leu Leu Leu Gly Phe Phe 

820 825 830 

caa aag get tat gcc ccc ttc ctg ctg cac tgg ate act cga ggt gtt 254 4 

Gin Lys Ala Tyr Ala Pro Phe Leu Leu His Trp He Thr Arg Gly Val 
835 840 * 845 

gtg ctg ctg ctg ttt etc gcc ctg ttc gga gtg age etc tac tec atg 2592 
Val Leu Leu Leu Phe Leu Ala Leu Phe Gly Val Ser Leu Tyr Ser Met 
850 855 860 

tgc cac ate age gtg gga ctg gac cag gag ctg gcc ctg ccc aag gac 2640 
Cys His He Ser Val Gly Leu Asp Gin Glu Leu Ala Leu Pro Lys Asp 
865 870 875 " 880 

* 

teg tac ctg ctt gac tat ttc etc ttt ctg aac cgc tac ttc gag gtg 2688 
Ser Tyr Leu Leu Asp Tyr Phe Leu Phe Leu Asn Arg Tyr Phe Glu Val 

885 890 " 895 

ggg gcc ccg gtg tac ttt gtt acc acc ttg ggc tac aac ttc tec age 2736 
Gly Ala Pro Val Tyr Phe Val Thr Thr Leu Gly Tyr Asn Phe Ser Ser 

900 905 910 

gag get ggg atg aat gcc ate tgc tec agt gca ggc tgc aac aac- ttc 2784 
Glu Ala Gly Met Asn Ala He Cys Ser Ser Ala Gly Cys Asn Asn Phe 
915 920 925 

tec ttc acc cag aag ate cag tat gcc aca gag ttc cct gag cag tct 2832 
Ser Phe Thr Gin Lys He Gin Tyr Ala Thr Glu Phe Pro Glu Gin Ser 
930 935 940 

tac ctg gcc ate cct gcc tec tec tgg gtg gat gac ttc att gac tgg 2880 
Tyr Leu Ala He Pro Ala Ser Ser Trp Val Asp Asp Phe He Asp Trp 
94 5 950 955 960 
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ctg acc ccg tec tec tgc tgc cgc ctt tat ata tct ggc ccc aat aag 2928 
Leu Thr Pro Ser Ser Cys Cys Arg Leu Tyr He Ser Gly Pro Asn Lys 

965 970 975 

gac aag ttc tgc ccc teg acc gtc aac tct ctg aac tgc eta aag aac 2976 
Asp Lys Phe Cys Pro Ser Thr Val Asn Ser Leu Asn Cys Leu Lys Asn 

980 985 990 

tgc atg age ate acg atg ggc tct gtg agg ccc teg gtg gag cag ttc 3024 
Cys Met Ser He Thr Met Gly Ser Val Arg Pro Ser Val Glu Gin Phe 
995 1000 1005 

cat aag tat ctt ccc tgg ttc ctg aac gac egg ccc aac ate aaa 3069 
His Lys Tyr Leu Pro Trp Phe Leu Asn Asp Arg Pro Asn He Lys 
1010 1015 1020 

tgt ccc aaa ggc ggc ctg gca gca tac age acc tct gtg aac ttg 3114 
Cys Pro Lys Gly Gly Leu Ala Ala Tyr Ser Thr Ser Val Asn Leu 
1025 1030 1035 

act tea gat ggc cag gtt tta gec tec agg ttc atg gee tat cac 3159 
Thr Ser Asp Gly Gin Val Leu Ala Ser Arg Phe Met Ala Tyr His 
1040 1045 1050 

aag ccc ctg aaa aac tea cag gat tac aca gaa get ctg egg gca 3204 
Lys Pro Leu Lys Asn Ser Gin Asp Tyr Thr Glu Ala Leu Arg Ala 
1055 1060 1065 

get cga gag ctg gca gec aac ate act get gac ctg egg aaa gtg 3249 
Ala Arg Glu Leu Ala Ala Asn He Thr Ala Asp Leu Arg Lys Val 
1070 1075 1080 

cct gga aca gac ccg get ttt gag gtc ttc ccc tac acg ate acc 3294 
Pro Gly Thr Asp Pro Ala Phe Glu Val Phe Pro Tyr Thr He Thr 
1085 1090 1095 

aat gtg ttt tat gag cag tac ctg acc ate etc cct gag ggg etc 3339 
Asn Val Phe Tyr Glu Gin Tyr Leu Thr He Leu Pro Glu Gly Leu 
1100 1105 1110 

ttc atg etc age etc tgc ctt gtg ccc acc ttc get gtc tec tgc 3384 
Phe Met Leu Ser Leu Cys Leu Val Pro Thr Phe Ala Val Ser Cys 
1115 1120 1125 

etc ctg ctg ggc ctg gac ctg cgc tec ggc etc etc aac ctg etc 3429 
Leu Leu Leu Gly Leu Asp Leu Arg Ser Gly Leu Leu Asn Leu Leu 
1130 H35 H40 

tec att gtc atg ate etc gtg gac act gtc ggc ttc atg gee ctg 3474 
Ser He Val Met He Leu Val Asp Thr Val Gly Phe Met Ala Leu 
1145 H50 H55 

tgg gac ate agt tac aat get gtg tec etc ate aac ctg gtc teg 3519 
Trp Asp He Ser Tyr Asn Ala Val Ser Leu He Asn Leu Val Ser 
1160 H65 H70 

gcg gtg ggc atg tct gtg gag ttt gtg tec cac att acc cgc tec 3564 
Ala Val Gly Met Ser Val Glu Phe Val Ser His He Thr Arg Ser 
1175 H80 H85 
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ttt gcc ate age ace aag ccc ace tgg ctg gag agg gee aaa gag 
Phe Ala lie Ser Thr Lys Pro Thr Trp Leu Glu Arg Ala Lys Glu 
1190 1195 1200 



3609 



gcc acc ate tct atg gga agt gcg gtg ttt gca ggt gtg gcc atg 
Ala Thr lie Ser Met Gly Ser Ala Val Phe Ala Gly Val Ala Met 
1205 1210 1215 



3654 



acc aac ctg cct ggc ate ctt gtc ctg ggc etc gcc aag gcc cag 
Thr Asn Leu Pro Gly lie Leu Val Leu Gly Leu Ala Lys Ala Gin 
1220 1225 1230 



3699 



etc att cag ate ttc ttc ttc cgc etc aac etc ctg ate act ctg 
Leu lie Gin lie Phe Phe Phe Arg Leu Asn Leu Leu lie Thr Leu 
1235 1240 1245 



3744 



ctg ggc ctg ctg cat ggc ttg gtc ttc ctg ccc gtc ate etc age 
Leu Gly Leu Leu His Gly Leu Val Phe Leu Pro Val lie Leu Ser 
1250 1255 1260 



3789 



tac gtg ggg cct gac gtt aac ccg get ctg gca ctg gag cag aag 
Tyr Val Gly Pro Asp Val Asn Pro Ala Leu Ala Leu Glu Gin Lys 
1265 1270 1275 



3834 



c 99 get gag gag gcg gtg gca gca gtc atg gtg gcc tct tgc cca 
Arg Ala Glu Glu Ala Val Ala Ala Val Met Val Ala Ser Cys Pro 
1280 1285 1290 



3879 



aat cac ccc tec cga gtc tec aca get gac aac ate tat gtc aac 
Asn His Pro Ser Arg Val Ser Thr Ala Asp Asn lie Tyr Val Asn 
1295 1300 1305 



3924 



cac age ttt gaa ggt tct ate aaa ggt get ggt gcc ate age aac 

His Ser Phe Glu Gly Ser He Lys Gly Ala Gly Ala He Ser Asn 
1310 1315 1320 

ttc ttg ccc aac aat ggg egg cag ttc tga 

Phe Leu Pro Asn Asn Gly Arg Gin Phe 
1325 1330 



3969 



3999 



<210> 4 

<211> 1332 

<212> PRT 

<213> Homo sapiens 



<400> 4 

Met Ala Glu Ala Gly Leu Arg Gly Trp Leu Leu Trp Ala Leu Leu Leu 
1 5 10 15 



Arg Leu Ala Gin Ser Glu Pro Tyr Thr Thr He His Gin Pro Gly Tyr 
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20 25 30 

Cys Ala Phe Tyr Asp Glu Cys Gly Lys Asn Pro Glu Leu Ser Gly Ser 
35 40 45 

Leu Met Thr Leu Ser Asn Val Ser Cys Leu Ser Asn Thr Pro Ala Arg 
50 55 60 

Lys lie Thr Gly Asp His Leu lie Leu Leu Gin Lys lie Cys Pro Arg 
65 70 75 80 

Leu Tyr Thr Gly Pro Asn Thr Gin Ala Cys Cys Ser Ala Lys Gin Leu 

85 90 ~ 95 

Val Ser Leu Glu Ala Ser Leu Ser He Thr Lys Ala Leu Leu Thr Arg 

100 105 " HO 

Cys Pro Ala Cys Ser Asp Asn Phe Val Asn Leu His Cys His Asn Thr 
115 120 125 

Cys Ser Pro Asn Gin Ser Leu Phe lie Asn Val Thr Arg Val Ala Gin 
130 135 140 

Leu Gly Ala Gly Gin Leu Pro Ala Val Val Ala Tyr Glu Ala Phe Tyr 
145 150 155 . 160 

Gin His Ser Phe Ala Glu Gin Ser Tyr Asp Ser Cys Ser Arg Val Arg 

165 170 175 

Val Pro Ala Ala Ala Thr Leu Ala Val Gly Thr Met Cys Gly Val Tyr 

180 185 190 

GLy Ser Ala Leu Cys Asn Ala Gin Arg Trp Leu Asn Phe Gin Gly Asp 
195 200 205 

Thir Gly Asn Gly Leu Ala Pro Leu Asp He Thr Phe His Leu Leu Glu 
210 215 220 

Pro Gly Gin Ala Val Gly Ser Gly He Gin Pro Leu Asn Glu Gly Val 
225 230 235 240 



Ala Arg Cys Asn Glu Ser Gin Gly Asp Asp Val Ala Thr Cys Ser Cys 

245 250 " 255 
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Gin Asp Cys Ala Ala Ser Cys Pro Ala lie Ala Arg Pro Gin Ala Leu 

260 265 270 

Asp Ser Thr Phe Tyr Leu Gly Gin Met Pro Gly Ser Leu Val Leu lie 
275 280 285 

lie lie Leu Cys Ser Val Phe Ala Val Val Thr lie Leu Leu Val Gly 
290 295 300 

Phe Arg Val Ala Pro Ala Arg Asp Lys Ser Lys Met Val Asp Pro Lys 
305 310 315 320 

Lys Gly Thr Ser Leu Ser Asp Lys Leu Ser Phe Ser Thr His Thr Leu 

325 330 335 

Leu Gly Gin Phe Phe Gin Gly Trp Gly Thr Trp Val Ala Ser Trp Pro 

340 345 350 

Leu Thr lie Leu Val Leu Ser Val lie Pro Val Val Ala Leu Ala Ala 
355 360 365 

Gly Leu Val Phe Thr Glu Leu Thr Thr Asp Pro Val Glu Leu Trp Ser 
370 375 380 

Ala Pro Asn Ser Gin Ala Arg Ser Glu Lys Ala Phe His Asp Gin His 
385 390 395 400 

Phe Gly Pro Phe Phe Arg Thr Asn Gin Val He Leu Thr Ala Pro Asn 

405 410 415 

Arg Ser Ser Tyr Arg Tyr Asp Ser Leu Leu Leu Gly Pro Lys Asn Phe 

420 425 430 

Ser Gly lie Leu Asp Leu Asp Leu Leu Leu Glu Leu Leu Glu Leu Gin 
435 440 445 

Glu Arg Leu Arg His Leu Gin Val Trp Ser Pro Glu Ala Gin Arg Asn 
450 455 460 

He Ser Leu Gin Asp He Cys Tyr Ala Pro Leu Asn Pro Asp Asn Thr 
46 5 470 475 480 



Ser Leu Tyr Asp Cys Cys He Asn Ser Leu Leu Gin Tyr Phe Gin Asn 

485 490 495 
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Asn Arg Thr Leu Leu Leu Leu Thr Ala Asn Gin Thr Leu Met Gly Gin 

500 505 510 

Thr Ser Gin Val Asp Trp Lys Asp His Phe Leu Tyr Cys Ala Asn Ala 
515 520 " 525 



Pro Leu Thr Phe Lys Asp Gly Thr Ala Leu Ala Leu Ser Cys Met Ala 
530 535 540 



Asp Tyr Gly Ala Pro Val Phe Pro Phe Leu Ala lie Gly Gly Tyr Lys 
545 550 555 " 560 

> 

Gly Lys Asp Tyr Ser Glu Ala Glu Ala Leu lie Met Thr Phe Ser Leu 

565 570 575 

Asn Asn Tyr Pro Ala Gly Asp Pro Arg Leu Ala Gin Ala Lys Leu Trp 

580 585 590 

Glu Glu Ala Phe Leu Glu Glu Met Arg Ala Phe Gin Arg Arg Met Ala 
595 600 605 

Gly Met Phe Gin Val Thr Phe Thr Ala Glu Arg Ser Leu Glu Asp Glu 
610 615 620 

lie Asn Arg Thr Thr Ala Glu Asp Leu Pro He Phe Ala Thr Ser Tyr 
625 630 635 640 

He Val He Phe Leu Tyr He Ser Leu Ala Leu Gly Ser Tyr Ser Ser 

645 650 655 

Trp Ser Arg Val Met Val Asp Ser Lys Ala Thr Leu Gly Leu Gly Gly 

660 665 670 

Val Ala Val Val Leu Gly Ala Val Met Ala Ala Met Gly Phe Phe Ser 
675 680 . 685 

Tyr Leu Gly He Arg Ser Ser Leu Val He Leu Gin Val Val Pro Phe 
690 695 700 

Leu Val Leu Ser Val Gly Ala Asp Asn He Phe lie Phe Val Leu Glu 
705 710 715 720 



Tyr Gin Arg Leu Pro Arg Arg Pro Gly Glu Pro Arg Glu Val His He 

725 730 " 735 
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Gly Arg Ala Leu Gly Arg Val Ala Pro Ser Met Leu Leu Cys Ser Leu 

740 " " 745 750 



Ser Glu Ala He Cys Phe Phe Leu Gly Ala Leu Thr Pro Met Pro Ala 
755 760 765 



Val Arg Thr Phe Ala Leu Thr Ser Gly Leu Ala Val He Leu Asp Phe 
770 775 " 780 



Leu Leu Gin Met Ser Ala Phe Val Ala Leu Leu Ser Leu Asp Ser Lys 
785 790 795 800 



Arg Gin Glu Ala Ser Arg Leu Asp Val Cys Cys Cys Val Lys Pro Gin 

805 810 815 



Glu Leu Pro Pro Pro Gly Gin Gly Glu Gly Leu Leu Leu Gly Phe Phe 

820 825 830 



Gin Lys Ala Tyr Ala Pro Phe Leu Leu His Trp He Thr Arg Gly Val 
835 840 845 



Val Leu Leu Leu Phe Leu Ala Leu Phe Gly Val Ser Leu Tyr Ser Met 
850 855 860 



Cys His He Ser Val Gly Leu Asp Gin Glu Leu Ala Leu Pro Lys Asp 
865 870 875 880 

Ser Tyr Leu Leu Asp Tyr Phe Leu Phe Leu Asn Arg Tyr Phe Glu Val 

885 890 895 

Gly Ala Pro Val Tyr Phe Val Thr Thr Leu Gly Tyr Asn Phe Ser Ser 

900 905 910 

Glu Ala Gly Met Asn Ala He Cys Ser Ser Ala Gly Cys Asn Asn Phe 
915 920 925 

Ser Phe Thr Gin Lys He Gin Tyr Ala Thr Glu Phe Pro Glu Gin Ser 
930 935 940 

Tyr Leu Ala He Pro Ala Ser Ser Trp Val Asp Asp Phe He Asp Trp 
945 950 955 960 



Leu Thr Pro Ser Ser Cys Cys Arg Leu Tyr He Ser Gly Pro Asn Lys 
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965 970 975 

Asp Lys Phe Cys Pro Ser Thr Val Asn Ser Leu Asn Cys Leu Lys Asn 

980 985 990 

Cys Met Ser lie Thr Met Gly Ser Val Arg Pro Ser Val Glu Gin Phe 
995 1000 1005 

His Lys Tyr Leu Pro Trp Phe Leu Asn Asp Arg Pro Asn lie Lys 
1010 1015 1020 

Cys Pro Lys Gly Gly Leu Ala Ala Tyr Ser Thr Ser Val Asn Leu 
1025 1030 1035 

Thr Ser Asp Gly Gin Val Leu Ala Ser Arg Phe Met Ala Tyr H±3 
1040 1045 1050 

Lys Pro Leu Lys Asn Ser Gin Asp Tyr Thr Glu Ala Leu Arg Ala 
1055 1060 1065 

Ala Arg Glu Leu Ala Ala Asn He Thr Ala Asp Leu Arg Lys Val 
1070 1075 1080 

Pro Gly Thr Asp Pro Ala Phe Glu Val Phe Pro Tyr Thr He Thr 
1085 1090 1095 

Asn Val Phe Tyr Glu Gin Tyr Leu Thr He Leu Pro Glu Gly Leu 
1100 1105 1110 

Phe Met Leu Ser Leu Cys Leu Val Pro Thr Phe Ala Val Ser Cys 
1115 1120 1125 

■ 

Leu Leu Leu Gly Leu Asp Leu Arg Ser Gly Leu Leu Asn Leu Leu 
1130 1135 1140 

Ser He Val Met He Leu Val Asp Thr Val Gly Phe Met Ala Leu 
1145 1150 H55 

Trp Asp He Ser Tyr Asn Ala Val Ser Leu He Asn Leu Val Ser 
1160 1165 1170 

Ala Val Gly Met Ser Val Glu Phe Val Ser His He Thr Arg Ser 
1175 1180 H85 
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Phe Ala He Ser Thr Lys Pro Thr Trp Leu Glu Arg Ala Lys Glu 
1190 3.195 1200 

Ala Thr He Ser Met Gly Ser Ala Val Phe Ala Gly Val Ala Met 
1205 L210 1215 

Thr Asn Leu Pro Gly He Leu Val Leu Gly Leu Ala Lys Ala Gin 
1220 1225 1230 

Leu He Gin He Phe Phe Prie Arg Leu Asn Leu Leu He Thr Leu 
1235 1240 1245 

Leu Gly Leu Leu His Gly Leu Val Phe Leu Pro Val He Leu Ser 
1250 1255 1260 

Tyr Val Gly Pro Asp Val Asn Pro Ala Leu Ala Leu Glu Gin Lys 
1265 1270 1275 

Arg Ala Glu Glu Ala Val Ala Ala Val Met Val Ala Ser Cys Pro 
1280 1285 1290 

Asn His Pro Ser Arg Val Ser Thr Ala Asp Asn He Tyr Val Asn 
1295 1300 1305 

His Ser Phe Glu Gly Ser He Lys Gly Ala Gly Ala He Ser Asn 
1310 1315 1320 

Phe Leu Pro Asn Asn Gly Ajrg Gin Phe 
1325 " 1330 

<210> 5 

<211> 885 

<212> DNA 

<213> Rattus sp. 



<400> 5 

ccacgcgtcc gcacctgcaa gtgtggtccc ctgaggcaga gcgcaacatc tccctccagg 60 

acatctgcta tgcccccctc aacccatata acaccagcct ctccgactgc tgtgtcaaca 120 

gcctccttca gtacttccag aacaa.ccgca ccctcctgat gctcacggcc aaccagactc 180 

tgaatggcca gacctccctg gtggactgga aggaccattt cctctactgt gcaaatgccc 240 
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caaagatggc acgtctctgg 


ccctgagctg 


catggctgac 


<_ a *^y y y y Vi» w 


300 


a ^ ^ /*> ^ *m 

Cuy uCutCCC 


cttccttgct gttgggggat 


accaaggcac 


ggactattcc 


yayyv^ctyocvy 


360 


cgctgatcat 


aaccttctct ctcaataact 


accccgctga 


tgatccccgc 


atggcccagg 


420 


ccaagctctg 


ggaggaggct ttcttgaagg 


aaatggaatc cttccagagg 


aacacaagtg 


480 


acaagttcca 


ggttgcgttc tcagctgagc 


gctctctgga 


ggatgagatc 


aaccgcacca 


540 


ccatccagga 


cctgcctgtc tttgccgtca 


gctacattat 


cgtcttcctg 


tacatctccc 


600 


tggccctggg 


cagctactcc agatgcagcc 


gagtagcggt 


ggagtccaag 


gctactctgg 


660 


gcctaggtgg 


ggtgatagtg tgctgggagc 


agttctggct 


tgcatggggc 


ttctaactcc 


720 


tacctgggtg 


tcccctcttc tctggttatc atccaagtgg 


tacctttcct 


ggtgcttaag 


780 


ctgtgggagc 


tggacacatc tacatcctag 


acttgagtac 


cagaggtacc 


taggaagccg 


840 


cggaacagcg 


aaaaggacac attgggcgca 


ccctgggcat 


gtggc 




885 



<210> 6 

<211> 458 

<212> DNA 

<213> Rattus sp. 



<400> 6 
gaccagatgt 


taaccaagct 


ctggtacagg aggagaaact 


agccagcgag 


gcagcagtgg 


60 


ccccagagcc 


ttcttgccca 


cagtacccct cccctgctga 


tgcggatgcc 


aatgttaact 


120 


acggctttgc 


cccagaactt 


gcccacggag ctaatgctgc tagaagctct 


ttgcccaaaa 


180 


gtgaccaaaa 


gttctaatgg 


agtaggagct tgtccatgct 


tctgctgatg agggatcatg 


240 


aaggtcttcc 


ctctggttgt 


cctcaaggcc tggggggagg 


ttgttcagag 


aaaaatggct 


300 


ggcattcctg 


ccacgaggca 


accggcagct tggcactgac 


tccttggtct 


cataggtccc 


360 


taaggcttgg 


tcagattact 


cctcatggag agactatctt 


aagtatctaa 


gctatcgatt 


420 


gggatgcatc 


gctgttcatt 


aaaaaggcta tggctatg 






458 



<210> 7 

<211> 896 

<212> DNA 

<213> Rattus sp. 
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<400> 7 

ccacgcgtcc gcagtttcat aagtacctgc cctggttcct gaatgatccg cccaatatca 60 

gatgtcccaa agggggtcta gcagcgtata gaacgtctgt gaatttgagc tcagatggcc 120 

aggttatagc ctcccagttc atggcctacc acaagccctt aaggaactca caggacttca 180 

cagaagctct ccgggcgtcc cggttgctag cagccaacat cacagctgac ctacggaagg 240 

tgcctgggac agatccaaac tttgaggtct tcccttacac gatctccaac gtgttctacc 300 

agcaatacct gacggtcctt cctgagggaa tcttzcaccct tgctctttgc tttgtgccca 360 

cctttgttgt ctgctacctc ctactgggcc tggacatgtg ctcagggatc ctcaacctac 420 

tctccatcat tatgattctc gtggacacca ttggrcctcat ggctgtgtgg ggtatcagct 480 

ataatgcggt atccctcatc aaccttgtca cggcagtggg catgtctgtg gagtttgtgt 540 

cccacatcac tcggtccttt gcttgtaagc accaagccta cccggctgga gagggctaaa 600 

agatgctact gtcttcatgg gcagtgcggt gtttgctgga gtggccatga ccaacttccc 660 

aggcatcctc atcttggggg ctttgcccca agcccaggct tattcagatc ttcttcttcc 720 

gcctcaacct tctgatcacc tttgctgggg tctgctgcat ggctggtctt cctgcccggt 780 

ttgtcctcag ctatctggga ccagatgtaa ccaaggctct gctacccgga ggagaaacta 840 

gccagcgagg gcagcagtgg ccccagagac ttcttzgccca caagtaccct tccctg 896 

<210> 8 
<211> 3124 
<212> DNA 
<213> Rattus sp. 

<400> 8 

tgcaagtgtg gtcccctgag gcagagcgca acatctccct ccaggacatc tgctatgccc 60 

ccctcaaccc atataacacc agcctctccg actgotgtgt caacagcctc cttcagtact 120 

tccagaacaa ccgcaccctc ctgatgctca cggccaacca gactctgaat ggccagacct 180 

ccctggtgga ctggaaggac catttcctct actg~bgcaaa tgcccctctc acgttcaaag 240 

atggcacgtc tctggccctg agctgcatgg ctgactacgg ggctcctgtc ttccccttcc 300 

ttgctgttgg gggataccaa ggcacggact attccgaggc agaagcgctg atcataacct 360 

tctctctcaa taactacccc gctgatgatc cccgcatggc ccaggccaag ctctgggagg 420 



WO 2005/069900 



28/78 



PCT/US2005/001469 



aggctttctt gaaggaaatg gaatccttcc agaggaacac aagtgacaag ttccaggttg 480 

cgttctcagc tgagcgctct ctggaggatg agatcaaccg caccaccatc caggacctgc 540 

ctgtctttgc cgtcagctac attatcgtct tcctgtacat ctccctggcc ctgggcagct 600 

actccagatg cagccgagta gcggtggagt ccaaggctac tctgggccta ggtggggtga 660 

ttgttgtgct gggagcagtt ctggctgcca tgggcttcta ctcctacctg ggtgtcccct 720 

cttctctggt tatcatccaa gtggtacctt tcctggtgct agctgtggga gctgacaaca 780 

tcttcatctt tgttcttgag taccagaggc tacctagga*t gcctggggaa cagcgagagg 840 

ctcacattgg ccgcaccctg ggcagtgtgg cccccagca-t gctgctgtgc agcctctctg 900 

aggccatctg cttctttcta ggggccctga cccccatgcc agctgtgagg accttcgcct 960 

tgacctctgg cttagcaatt atcctcgact tcctgctcca gatgactgcc tttgtggccc 1020 

tgctctccct ggatagcaag aggcaggagg cctctcgccc ggatgtctta tgctgctttt 1080 

caacccggaa gctgccccca cctaaagaaa aagaaggcct: cttactccgc ttcttccgca 1140 

agatatacgc tcctttcctg ctgcacagat tcatccgccc tgttgtgatg ctgctgtttc 1200 

tgaccctgtt tggagcaaat ctctacttaa tgtgcaacat: caacgtgggg ctagaccagg 1260 

agctggctct gcccaaggac tcgtacttga tagactactt cctctttctg aaccgatacc 1320 

ttgaagtggg gcctccagtg tactttgtca ccacctcgggr cttcaacttc tccagcgagg 1380 

caggcatgaa cgccacttgc tctagcgcag gctgtaagag cttctcccta acccagaaaa 14 4 0 

tccagtatgc cagtgaattc cctgaccagt cttacgtggc tattgctgca tcctcctggg 1500 

tagatgactt catcgactgg ctgaccccgt cctcctcctg ctgtcgcctt tatatacgtg 1560 

gcccccataa ggatgagttc tgtccctcaa cggatacttc cttcaactgc ttaaaaaact 1620 

gcatgaaccg cactctgggt cctgtgaggc ccacagcgga acagtttcat aagtacctgc 1680 

cctggttcct gaatgatccg cccaatatca gatgtcccaa agggggtcta gcagcgtata 1740 

gaacgtctgt gaatttgagc tcagatggcc aggttatagc ctcccagttc atggcctacc 1800 

acaagccctt aaggaactca caggacttca cagaagctct ccgggcgtcc cggttgctag 1860 

cagccaacat cacagctgac ctacggaagg tgcctgggac agatccaaac tttgaggtct 1920 

tcccttacac gatctccaac gtgttctacc agcaatacct gacggtcctt cctgagggaa 1980 

tcttcaccct tgctctttgc tttgtgccca cctttgttgt ctgctacctc ctactgggcc 2040 

tggacatgtg ctcagggatc ctcaacctac tctccatcat tatgattctc gtggacacca 2100 

ttggcctcat ggctgtgtgg ggtatcagct ataatgcggt atccctcatc aaccttgtca 2160 

cggcagtggg catgtctgtg gagtttgtgt cccacatcac tcggtccttt gctgtaagca 2220 
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ccaagcctac ccggctggag agggctaaag atgctactgt cttcatgggc agtgcggtgt 2280 

ttgctggagt ggccatgacc aacttcccag gcatcctcat cttgggcttt gcccaagccc 2340 

agcttattca gatcttcttc ttccgcctca accttctgat caccttgctg ggtctgctgc 2400 

atggcctggt cttcctgccg gttgtcctca gctatctggg accagatgtt aaccaagctc 24 60 

tggtacagga ggagaaacta gccagcgagg cagcagtggc cccagagcct tcttgcccac 2520 

agtacccctc ccctgctgat gcggatgcca atgttaacta cggctttgcc ccagaacttg 2580 

cccacggagc taatgctgct agaagctctt tgcccaaaag tgaccaaaag ttctaatgga 2640 

gtaggagctt gtccatgctt cttgctgatg agggatcatg aaggtcttcc ctctggttgt 2700 

cctcaaggcc tggggggagg ttgtttcaga gaaaaatggc tggcattcct gccacgaggc 2760 

aaccggcagc attggcactg acctccttgc tctcataggt ccctaaggcc ttggtcagat 2820 

tacctcctcc atggagagac tatcttaagt atcttaagta tcgtatggga tgcatcgcct 2880 

gtcaattaaa aaggctatgg cctatggctc aggcagggcc atccggaaga agagaggatt 2940 

ctgggataaa gccaggtggg agattcgcct ggggaaaatg tgacaatggt tcctgagcat 3000 

gggcaatcag ccatgtggca gaatgtaaat taatataaat gggttgtctt aagttatgat 3060 

tctagctggg gaggagccta gctgtgtagc caagatattt gtaaatataa aaaaaaaaaa 3120 

aaaa - 3124 

<210> 9 
<211> 4484 
<212> DMA 
<213> Rattus sp. 



<4 00> 9 

atggcagctg cctggctggg atggctgctc tgggccctgc tcctgagcgc ggcccagggt 60 

gagctataca cacccaaaca cgaagctggg gtctgcacct tttacgaaga gtgcgggaaa 120 

aacccagagc tctctggagg cctcacgtca ctatccaatg tatcctgcct gtctaacacc 180 

ccggcccgcc acgtcacggg tgaacacctg gctcttctcc agcgcatctg tccccgcctg 24 0 

tacaacggcc ccaataccac ttttgcctgt tgctctacca agcagctgct gtccttagaa 300 

agcagcatgt ccatcaccaa ggcccttctc acgcgctgcc cggcctgctc tgacaatttt 360 

gtgagcttac actgccacaa cacttgcagc cctgaccaga gcctcttcat caacgtcacc 420 
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cgggtggttg agcggggcgc tggagagcct cctgccgtgg tggcctatga ggccttttat 480 

cagcgcagct ttgctgagaa ggcctatgag tcctgcagcc aggtgcgcat ccctgcggcc 540 

gcttccttgg ccgtgggcag catgtgtgga gtgtatggct ccgccctctg caatgctcag 600 

cgctggctca acttccaagg agacacaggg aatggcctgg ctccgctgga tatcaccttc 660 

cacctcttgg agcctggcca ggccctaccg gatgggatcc agccactgaa tgggaagatc 720 

gcaccctgca acgagtctca gggtgatgac tcagcagtct gctcctgcca ggactgtgcg- 780 

gcgtcctgcc ctgtcatccc tccgcccgag gccttgcgcc cttccttcta catgggtcgc 840 

atgccaggct ggctggccct catcatcatc ttcactgctg tctttgtgtt <gctctctgca 900 

gtccttgtgc gtctccgagt ggtttccaac aggaacaaga acaaggcaga aggcccccag 960 

gaagccccca aactccctca taagcacaaa ctctcacccc ataccatcct crggccggttc 1020 

ttccagaact ggggcacaag ggtggcctcg tggccactca ccgtcttagc actgtccttc 1080 

atcgttgtga tagccttagc agcaggcctg acctttattg aactcaccac agaccctgtg 1140 

.gaactgtggt cggcccccaa gagccaggcc cggaaagaga agtctttcca tgatgagcat 1200 

ttcggcccct tctttcgaac caaccagatt ttcgtgacag ctcggaacag gtccagctac 1260 

aagtacgact ccctactgct agggtccaag aacttcagtg ggatcctgtc cctggacttc 1320 

ctgctggagc tgctggagct tcaggagagg cttcgacacc tgcaagtgtg gtcccctgag 1380 

gcagagcgca acatctccct ccaggacatc tgctatgccc ccctcaaccc atataacacc 1440 

agcctctccg actgctgtgt caacagcctc cttcagtact tccagaacaa ccgcaccctc 1500 

ctgatgctca cggccaacca gactctgaat ggccagacct ccctggtgga ctggaaggac 1560 

catttcctct actgtgcaaa tgcccctctc acgttcaaag atggcacgtc tctggccctg 1620 

agctgcatgg ctgactacgg ggctcctgtc ttccccttcc ttgctgttgg gggataccaa 1680 

ggcacggact attccgaggc agaagcgctg atcataacct tctctctcaa taactacccc 174 0 

gctgatgatc cccgcatggc ccaggccaag ctctgggagg aggctttctt gaaggaaatg 1800 

gaatccttcc agaggaacac aagtgacaag ttccaggttg cgttctcagc tgagcgctct 1860 

ctggaggatg agatcaaccg caccaccatc caggacctgc ctgtctttgc cgtcagctac 1920 

attatcgtct tcctgtacat ctccctggcc ctgggcagct actccagatg cagccgagta 1980 

gcggtggagt ccaaggctac tctgggccta ggtggggtga ttgttgtgct gggagcagtt 2040 

ctggctgcca tgggcttcta ctcctacctg ggtgtcccct cttctctggt tatcatccaa 2100 

gtggtacctt tcctggtgct agctgtggga gctgacaaca tcttcatctt tgttcttgag 2160 

taccagaggc tacctaggat gcctggggaa cagcgagagg ctcacattgg ccgcaccctg 2220 
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ggcagtgtgg cccccagcat gctgctgtgc agcctctctg aggccatctg cttctttcta 2280 

ggggccctga cccccatgcc agctgtgagg accttcgcct tgacctctgg cttagcaatt 2340 

atcctcgact tcctgctcca gatgactgcc tttgtggccc tgctctccct ggatagcaag 2400 

aggcaggagg cctctcgccc ggatgtctta tgctgctttt caacccggaa gctgccccca 24 60 

cctaaagaaa aagaaggcct cttactccgc ttcttccgca agatatacgc tcctttcctg 2520 

ctgcacagat tcatccgccc tgttgtgatg ctgctgtttc tgaccctgtt tggagcaaat 2580 

ctctacttaa tgtgcaacat caacgtgggg ctagaccagg agctggctct gcccaaggac 2640 

tcgtacttga tagactactt cctctttctg aaccgatacc ttgaagtggg gcctccagtg 2700 

tactttgtca ccacctcggg cttcaacttc tccagcgagg caggcatgaa cgccacttgc 2760 

tctagcgcag gctgtaagag cttctcccta acccagaaaa tccagtatgc cagtgaattc 2820 

cctgaccagt cttafcgtggc tattgctgca tcctcctggg tagatgactt catcgactgg 2880 

ctgaccccgt cctcctcctg ctgtcgcctt tatatacgtg gcccccataa ggatgagttc 2940 

tgtccctcaa cggatacttc cttcaactgc ttaaaaaact gcatgaaccg cactctgggt 3000 

cctgtgaggc ccacagcgga aca^tttcat aagtacctgc cctggttcct gaatgatccg 3060 

cccaatatca gatgtcccaa agggggtcta gcagcgtata gaacgtctgt gaatttgagc 3120 

tcagatggcc aggttatagc ctcccagttc atggcctacc acaagccctt aaggaactca 3180 

caggacttca cagaagctct ccgggcgt cc cggttgctag cagccaacat cacagctgac 3240 

ctacggaagg tgcctgggac agatccaaac tttgaggtct tcccttacac gatctccaac 3300 

gtgttctacc agcaatacct gacggtcctt cctgagggaa tcttcaccct tgctctttgc 3360 

tttgtgccca cctttgttgt ctgctacctc ctactgggcc tggacatgtg ctcagggatc 3420 

ctcaacctac tctccatcat tatgattctc gtggacacca ttggcctcat ggctgtgtgg 3480 

ggtatcagct ataatgcggt atccctcatc aaccttgtca cggcagtggg catgtctgtg 3540 

gagtttgtgt cccacatcac tcggtccttt gctgtaagca ccaagcctac ccggctggag 3600 

agggctaaag atgctactgt cttcatgggc agtgcggtgt ttgctggagt ggccatgacc 3660 

aacttcccag gcatcctcat cttgggcttt gcccaagccc agcttattca gatcttcttc 3720 

ttccgcctca accttctgat caccttgctg ggtctgctgc atggcctggt cttcctgccg 3780 

gttgtcctca gctatctggg accagatgtt aaccaagctc tggtacagga ggagaaacta 384 0 

gccagcgagg cagcagtggc cccagagcct tcttgcccac agtacccctc ccctgctgat 3900 

gcggatgcca atgttaacta cggctttgcc ccagaacttg cccacggagc taatgctgct 3960 
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agaagctctt tgcccaaaag tgaccaaaag ttctaatgga gtaggagctt gtccatgctt 4020 

cttgctgatg agggatcatg aaggtottcc ctctggttgt cctcaaggcc tggggggagg 4 080 

ttgtttcaga gaaaaatggc tggca-fctcct gccacgaggc aaccggcagc attggcactg 4140 

acctccttgc tctcataggt ccctasggcc ttggtcagat tacctcctcc atggagagac 4 200 

tatcttaagt atcttaagta tcgta-fcggga tgcatcgcct gtcaattaaa aaggctatgg 4 260 

cctatggctc aggcagggcc atccggaaga agagaggatt ctgggataaa gccaggtggg 4 320 

agattcgcct ggggaaaatg tgacaatggt tcctgagcat gggcaatcag ccatgtggca 4 380 

gaatgtaaat taatataaat gggttgtctt aagttatgat tctagctggg gaggagccta 4 440 

gctgtgtagc caagatattt gtaaatataa aaaaaaaaaa aaaa 4 484 

<210> 10 

<211> 3993 
<212> DNA 

<213> Rattus sp. 
<220> 

<221> misc — feature 

<222> (1) . . (3993) 

<223> n is g or a or t or c: 

<400> 10 

atggcngcng cntggytngg ntggyfcnytn tgggcnytny tnytnwsngc ngcncarggn 60 

garytntaya cnccnaarca ygargcnggn gtntgyacnt tytaygarga rtgyggnaar 120 

aayccngary tnwsnggngg nytnacnwsn ytnwsnaayg tnwsntgyyt nwsnaayacn 180 

ccngcnmgnc aygtnacngg ngarcayytn gcnytnytnc armgnathtg yccnmgnytn 240 

tayaayggnc cnaayacnac nttygcntgy tgywsnacna arcarytnyt nwsnytngar 300 

wsnwsnatgw snathacnaa rgcnytnytn acnmgntgyc cngcntgyws ngayaaytty 360 

gtnwsnytnc aytgycayaa yacntg-ywsn ccngaycarw snytnttyat haaygtnacn 420 

ingngtngtng armgnggngc nggngaxccn ccngcngtng tngcntayga rgcnttytay 480 

carmgnwsnt tygcngaraa rgcntaygar wsntgywsnc argtnmgnat hccngcngcn 540 

gcnwsnytng cngtnggnws natgtgyggn gtntayggnw sngcnytntg yaaygcncar 600 

mgntggytna ayttycargg ngayacnggn aayggnytng cnccnytnga yathacntty 660 

cayytnytng arccnggnca rgcnytnccn gayggnathc arccnytnaa yggnaarath 720 
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gcnccntgya aygarwsnca rggngaygay wsngcngtnt gywsntgyca rgaytgygcn 780 

gcnwsntgyc cngtnathcc nccnccngar gcnytnmgnc cnwsnttyta yatgggnmgn 840 

atgccnggnt ggytngcnyt nathathath ttyacngcng tnttygtnyt nytnwsngcn 90O 

gtnytngtnm gnytnmgngt ngtnwsnaay mgnaayaara ayaargcnga rggnccncar 960 

gargcnccna arytnccnca yaarcayaar ytnwsnccnc ayacnathyt nggnmgntty 1020 

ttycaraayt ggggnacnmg ngtngcnwsn tggccnytna cngtnytngc nytnwsntty 1080 

athgtngtna thgcnytngc ngcnggnytn acnttyathg arytnacnac ngayccngtn 1140 

garytntggw sngcnccnaa rwsncargcn mgnaargara arwsnttyca ygaygarcay 1200 

ttyggnccnt tyttymgnac naaycarath ttygtnacng cnmgnaaymg nwsnwsntay 1260 

aartaygayw snytnytnyt nggnwsnaar aayttywsng gnathytnws nytngaytty 1320 

ytnytngary tnytngaryt ncargarmgn ytnmgncayy tncargtntg gwsnccngar 1380 

gcngarmgna ayathwsnyt ncargayath tgytaygcnc cnytnaaycc ntayaayacn 14 40 

wsnytnwsng aytgytgygt naaywsnytn ytncartayt tycaraayaa ymgnacnytn 1500 

ytnatgytna cngcnaayca racnytnaay ggncaracnw snytngtnga ytggaargay 1560 

cayttyytnt aytgygcnaa ygcnccnytn acnttyaarg ayggnacnws nytngcnytn 1620 

wsntgyatgg cngaytaygg ngcnccngtn ttyccnttyy tngcngtngg nggntaycar 1680 

ggnacngayt aywsngargc ngargcnytn athathacnt tywsnytnaa yaaytayccn 1740 

gcngaygayc cnmgnatggc ncargcnaar ytntgggarg argent tyyt naargaratg 1800 

garwsnttyc armgnaayac nwsngayaar ttycargtng cnttywsngc ngarmgnwsn 1860 

ytngargayg arathaaymg nacnacnath cargayytnc cngtnttygc ngtnwsntay 1920 

athathgtnt tyytntayat hwsnytngen ytnggnwsnt aywsnmgntg ywsnmgngtn 1980 

gcngtngarw snaargenac nytnggnytn ggnggngtna thgtngtnyt nggngcngtn 2040 

ytngengena tgggnttyta ywsntayytn ggngtnccnw snwsnytngt nathathcar 2100 

gtngtnccnt tyytngtnyt ngcngtnggn gengayaaya th-ttyathtt ygtnytngar 2160 

taycarmgny tnccnmgnat gecnggngar carmgngarg encayathgg nmgnacnytn 2220 

ggnwsngtng encenwsnat gytnytntgy wsnytnwsng argenathtg yttyttyytn 2280 

ggngenytna cnccnatgcc ngcngtnmgn acnttygeny tnacnwsngg nytngenath 2340 

athytngayt tyytnytnea ratgacngen ttygtngcny tnytnwsnyt ngaywsnaar 2400 

mgncargarg cnwsnmgncc ngaygtnytn tgytgyttyw snaenmgnaa rytnccnccn 24 60 
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ccnaargara argarggnyt nytnytnmgn ttyttymgna arathtaygc nccnttyytn 2520 

ytncaymgnt tyathragncc ngtngtnatg ytnytnttyy tnacnytntt yggngcnaay* 2580 

ytntayytna tgtgyaayat haaygtnggn ytngaycarg arytngcnyt nccnaargay 2640 

wsntayytna thgaytaytt yytnttyytn aaymgntayy tngargtngg nccnccngtn 2700 

tayttygtna cnacnwsngg nttyaaytty wsnwsngarg cnggnatgaa ygcnacntgy 2760 

wsnwsngcng gntgyaarws nttywsnytn acncaraara thcartaygc nwsngartty 2820 

ccngaycarw sntaygtngc nathgcngcn wsnwsntggg tngaygaytt yathgaytgg 2880 

ytnacnccnw snwsnwsntg ytgymgnytn tayathmgng gnccncayaa rgaygartty 2940 

tgyccnwsna cngayacnws nttyaaytgy ytnaaraayt gyatgaaymg nacnytnggn 3000 

ccngtnmgnc cnacngcnga rcarttycay aartayytnc cntggttyyt naaygayccn 3060 

ccnaayathm gntgyccnaa rggnggnytn gcngcntaym gnacnwsngt naayytnwsn 3120 

wsngayggnc argtnathgc nwsncartty atggcntayc ayaarccnyt nmgnaaywsn 3180 

cargayttya cngargcnyt nmgngcnwsn mgnytnytng cngcnaayat hacngcngay 3240 

ytnmgnaarg tnccnggnac ngayccnaay ttygargtnt tyccntayac nathwsnaay 3300 

gtnttytayc arcartayyt nacngtnytn ccngarggna thttyacnyt ngcnytntgy 3360 

ttygtnccna cnttygtngt ntgytayytn ytnytnggny tngayatgtg ywsnggnath 3420 

ytnaayytny tnwsnathat hatgathytn gtngayacna thggnytnat ggcngtntgg 3480 

ggnathwsnt ayaaygcngt nwsnytnath aayytngtna cngcngtngg natgwsngtn 3540 

garttygtnw sncayathac nmgnwsntty gcngtnwsna cnaarccnac nmgnytngar 3600 

mgngcnaarg aygcnacngt nttyatgggn wsngcngtnt tygcnggngt ngcnatgacn 3660 

aayttyccng gnathytnat hytnggntty gcncargcnc arytnathca rathttytty 3720 

ttymgnytna ayytnytnat hacnytnytn ggnytnytnc ayggnytngt nttyytnccn 3780 

gtngtnytnw sntayytngg nccngaygtn aaycargcny tngtncarga rgaraarytn 3840 

gcnwsngarg cngcngtngc nccngarccn wsntgyccnc artayccnws nccngcngay 3900 

gcngaygcna aygtnaayta yggnttygcn ccngarytng cncayggngc naaygcngcn 3960 

ragnwsnwsny tnccnaarws ngaycaraar tty 3993 

<210> 11 
<211> 4002 
<212> DNA 
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<213> Mus sp. 



<220> 

<221> CDS 

<222> (1)..(4002) 

<223> 



<400> 11 

atg gca get gec tgg cag gga tgg ctg etc tgg gec ctg etc c=tg aat 48 
Met Ala Ala Ala Trp Gin Gly Trp Leu Leu Trp Ala Leu Leu Leu Asn 
1 5 * 10 JL5 

teg gec cag ggt gag etc tac aca ccc act cac aaa get ggc ttc tgc 96 
Ser Ala Gin Gly Glu Leu Tyr Thr Pro Thr His Lys Ala Gly Phe Cys 

20 25 30 

ace ttt tat gaa gag tgt ggg aag aac cca gag ctt tct gga cjgc etc 144 
Thr Phe Tyr Glu Glu Cys Gly Lys Asn Pro Glu Leu Ser Gly Sly Leu 
35 40 45 

aca tea eta tec aat ate tec tgc ttg tct aat ace cca gec c=gc cat 192 
Thr Ser Leu Ser Asn lie Ser Cys Leu Ser Asn Thr Pro Ala A*xg His 
50 55 60 

gtc aca ggt gac cac ctg get ctt etc cag cgc gtc tgt ccc c=gc eta 240 
Val Thr Gly Asp His Leu Ala Leu Leu Gin Arg Val Cys Pro A-rg Leu 
65 70 75 80 

tac aat ggc ccc aat gac ace tat gee tgt tgc tct ace aag cag ctg 288 
Tyr Asn Gly Pro Asn Asp Thr Tyr Ala Cys Cys Ser Thr Lys &ln Leu 

85 90 9 5 

gtg tea tta gac agt age ctg tct ate acc aag gec etc ctt a ca cgc 336 
Val Ser Leu Asp Ser Ser Leu Ser lie Thr Lys Ala Leu Leu Tlir Arg 

100 105 ~ 110 

tgc ccg gca tgc tct gaa aat ttt gtg age ata cac tgt cat a at acc 384 
Cys Pro Ala Cys Ser Glu Asn Phe Val Ser lie His Cys His Asn Thr 
115 120 125 

tgc age cct gac cag age etc ttc ate aat gtt act cgc gtg g*tt cag 4 32 

Cys Ser Pro Asp Gin Ser Leu Phe He Asn Val Thr Arg Val V«al Gin 
130 135 140 

egg gac cct gga cag ctt cct get gtg gtg gec tat gag gec t~tt tat 4 80 

Arg Asp Pro Gly Gin Leu Pro Ala Val Val Ala Tyr Glu Ala Plie Tyr 
145 150 155 160 

caa cgc agt ttt gca gag aag gec tat gag tec tgt age egg g"tg cgc 528 
Gin Arg Ser Phe Ala Glu Lys Ala Tyr Glu Ser Cys Ser Arg Val Arg 

165 170 1-75 
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ate cct gca get gee teg ctg get gtg ggc age atg tgt gga gtg tat 576 
lie Pro Ala Ala Ala Ser Leu Ala Val Gly Ser Met Cys Gly Val Tyr 

180 185 190 

ggc tct gee etc tgc aat get cag cgc tgg etc aac ttc caa gga gac 624 
Gly Ser Ala Leu Cys Asn Ala Gin Arg Trp Leu Asn Phe Gin Gly Asp 
195 200 205 

aca ggg aat ggc ctg get ccg ctg gac ate acc ttc cac etc ttg gag 672 
Thr Gly Asn Gly Leu Ala Pro Leu Asp He Thr Phe His Leu Leu Glu 
210 215 220 

cct ggc cag gee ctg gca gat ggg atg aag cca ctg gat ggg aag ate 720 
Pro Gly Gin Ala Leu Ala Asp Gly Met Lys Pro Leu Asp Gly Lys He 
225 230 235 240 

aca ccc tgc aat gag tec cag ggt gaa gac teg gca gee tgt tec tgc 768 
Thr Pro Cys Asn Glu Ser Gin Gly Glu Asp Ser Ala Ala Cys Ser Cys 

245 250 255 

cag gac tgt gca gca tec tgc cct gtc ate cct ccg ccc ccg gee ctg 816 
Gin Asp Cys Ala Ala Ser Cys Pro Val He Pro Pro Pro Pro Ala Leu 

260 265 270 

cgc cct tct ttc tac atg ggt cga atg cca ggc tgg ctg get etc ate 864 
Arg Pro Ser Phe Tyr Met Gly Arg Met Pro Gly Trp Leu Ala Leu He 
275 280 285 

ate ate ttc act get gtc ttt gta ttg etc tct gtt gtc ctt gtg tat 912 
He He Phe Thr Ala Val Phe Val Leu Leu Ser Val Val Leu Val Tyr 
290 295 300 

etc cga gtg get tec aac agg aac aag aac aag aca gca ggc tec cag 960 
Leu Arg Val Ala Ser Asn Arg Asn Lys Asn Lys Thr Ala Gly Ser Gin 
305 310 315 320 

gaa gec ccc aac etc cct cgt aag cgc aga ttc tea cct cac act gtc 1008 
Glu Ala Pro Asn Leu Pro Arg Lys Arg Arg Phe Ser Pro His Thr Val 

325 330 335 

ctt ggc egg ttc ttc gag age tgg gga aca agg gtg gee tea tgg cca 1056 
Leu Gly Arg Phe Phe Glu Ser Trp Gly Thr Arg Val Ala Ser Trp Pro 

340 345 350 

etc act gtc ttg gca ctg tec ttc ata gtt gtg ata gee ttg tea gta 1104 
Leu Thr Val Leu Ala Leu Ser Phe He Val Val He Ala Leu Ser Val 
355 360 365 

ggc ctg acc ttt ata gaa etc acc aca gac cct gtg gaa ctg tgg teg 1152 
Gly Leu Thr Phe He Glu Leu Thr Thr Asp Pro Val Glu Leu Trp Ser 
370 375 380 

gec cct aaa age caa gec egg aaa gaa aag get ttc cat gac gag cat 1200 
Ala Pro Lys Ser Gin Ala Arg Lys Glu Lys Ala Phe His Asp Glu His 
385 390 395 400 

ttt ggc ccc ttc ttc cga acc aac cag att ttt gtg aca get aag aac 1248 
Phe Gly Pro Phe Phe Arg Thr Asn Gin He Phe Val Thr Ala Lys Asn 

405 410 415 
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agg tec age tac aag tac gac tec ctg ctg eta ggg ccc aag aac ttc 1296 
Arg Ser Ser Tyr Lys Tyr Asp Ser Leu Leu Leu Gly Pro Lys Asn Phe 

420 425 430 

agt ggg ate eta tec ctg gac ttg ctg cag gag ctg ttg gag eta cag 1344 
Ser Gly He Leu Ser Leu Asp Leu Leu Gin Glu Leu Leu Glu Leu Gin 
435 440 445 

gag aga ctt cga cac ctg caa gtg tgg tec cat gag gca cag cgc aac 1392 
Glu Arg Leu Arg His Leu Gin Val Trp Ser His Glu Ala Gin Arg Asn 
450 455 460 

ate tec etc cag gac ate tgc tat get ccc etc aac ccg cat aac ace 14 40 

He Ser Leu Gin Asp He Cys Tyr Ala Pro Leu Asn Pro His Asn Thr 
465 470 475 480 

age etc act gac tgc tgt gtc aac age etc ctt caa tac ttc cag aac 14 88 

Ser Leu Thr Asp Cys Cys Val Asn Ser Leu Leu Gin Tyr Phe Gin Asn 

485 "4 90 495 

aac cac aca etc ctg ctg etc aca gec aat cag act ctg aat ggc cag 1536 
Asn His Thr Leu Leu Leu Leu Thr Ala Asn Gin Thr Leu Asn Gly Gin 

500 505 510 

acc tec ctg gtg gac tgg aag gac cat ttc etc tac tgt gee aat gec 1584 
Thr Ser Leu Val Asp Trp Lys Asp His Phe Leu Tyr Cys Ala Asn Ala 
515 520 * 525 

cct etc acg tac aaa gat ggc aca gee ctg gee ctg age tgc ata get 1632 
Pro Leu Thr Tyr Lys Asp Gly Thr Ala Leu Ala Leu Ser Cys He Ala 
530 535 540 

gac tac ggg gca cct gtc ttc ccc ttc ctt get gtt ggg ggc tac caa 1680 
Asp Tyr Gly Ala Pro Val Phe Pro Phe Leu Ala Val Gly Gly Tyr Gin 
545 550 555 560 

ggg acg gac tac teg gag gca gaa gec ctg ate ata acc ttc tct ate 1728 
Gly Thr Asp Tyr Ser Glu Ala Glu Ala Leu He He Thr Phe Ser He 

565 570 575 

aat aac tac ccc get gat gat ccc cgc atg gec cac gee aag etc tgg 177 6 
Asn Asn Tyr Pro Ala Asp Asp Pro Arg Met Ala His Ala Lys Leu Trp 

580 585 590 

gag gag get ttc ttg aag gaa atg caa tec ttc cag aga age aca get 1824 
Glu Glu Ala Phe Leu Lys Glu Met Gin Ser Phe Gin Arg Ser Thr Ala 
595 600 605 

gac aag ttc cag att gcg ttc tea get gag cgt tct ctg gag gac gag 1872 
Asp Lys Phe Gin He Ala Phe Ser Ala Glu Arg Ser Leu Glu Asp Glu 
610 615 620 

ate aat cgc act acc ate cag gac ctg cct gtc ttt gee ate age tac 1920 
He Asn Arg Thr Thr He Gin Asp Leu Pro Val Phe Ala He Ser Tyr 
625 630 635 640 

ctt ate gtc ttc ctg tac ate tec ctg gee ctg ggc age tac tec aga 1968 
Leu He Val Phe Leu Tyr He Ser Leu Ala Leu Gly Ser Tyr Ser Arg 
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64 5 650 655 

tgg age cga gtt gcg gtg gat tec aag get act ctg ggc eta ggt ggg 2016 
Trp Ser Arg Val Ala Val Asp Ser Lys Ala Thr Leu Gly Leu Gly Gly 

660 665 670 

gtg get gtt gtg ctg gga gca gtc gtc get gec atg ggc ttc tac tec 2064 
Val Ala Val Val Leu Gly Ala Val Val Ala Ala Met Gly Phe Tyr Ser 
675 680 685 

tac ctg ggt gtc ccc tec tct ctg gtc ate att caa gtg gta cct ttc 2112 
Tyr Leu Gly Val Pro Ser Ser Leu Val lie lie Gin Val Val Pro Phe 
690 " 695 700 

■ 

ctg gtg ctg get gtg gga get gac aac ate ttc ate ttt gtt ctt gag 2160 
Leu Val Leu Ala Val Gly Ala Asp Asn lie Phe lie Phe Val Leu Glu 
705 710 715 720 

tac cag agg ctg cct agg atg ccc ggg gag cag cga gag get cac att 2208 
Tyr Gin Arg Leu Pro Arg Met Pro Gly Glu Gin Arg Glu Ala His lie 

725 730 735 

ggc cgc acc ctg ggt agt gtg gec ccc age atg ctg ctg tgc age etc 2256 
Gly Arg Thr Leu Gly Ser Val Ala Pro Ser Met Leu Leu Cys Ser Leu 

740 745 750 

tct gag gee ate tgc ttc ttt eta ggg gec ctg acc tec atg cca get 2304 
Ser Glu Ala lie Cys Phe Phe Leu Gly Ala Leu Thr Ser Met Pro Ala 
755 760 765 

gtg agg acc ttt gec ttg acc tct ggc tta gca ate ate ttt gac ttc 2352 
Val Arg Thr Phe Ala Leu Thr Ser Gly Leu Ala lie lie Phe Asp Phe 
770 775 780 

ctg etc cag atg aca gec ttt gtg gec ctg etc tec ctg gat age aag 2400 
Leu Leu Gin Met Thr Ala Phe Val Ala Leu Leu Ser Leu Asp Ser Lys 
785 790 795 800 

agg cag gag gec tct cgc ccc gac gtc gtg tgc tgc ttt tea age cga 2448 
Arg Gin Glu Ala Ser Arg Pro Asp Val Val Cys Cys Phe Ser Ser Arg 

805 810 815 

aat ctg ccc cca ccg aaa caa aaa gaa ggc etc tta ctt tgc ttc ttc 24 96 
Asn Leu Pro Pro Pro Lys Gin Lys Glu Gly Leu Leu Leu Cys Phe Phe 

820 825 830 



cgc aag ata tac act ccc ttc ctg ctg cac aga ttc ate cgc cct gtt 
Arg Lys lie Tyr Thr Pro Phe Leu Leu His Arg Phe lie Arg Pro Val 
835 840 845 



2544 



gtg ctg ctg etc ttt ctg gtc ctg ttt gga gca aac etc tac tta atg 2592 
Val Leu Leu Leu Phe Leu Val Leu Phe Gly Ala Asn Leu Tyr Leu Met 
850 855 860 

tgc aac ate age gtg ggg ctg gac cag gat ctg get ctg ccc aag gat 2640 
Cys Asn He Ser Val Gly Leu Asp Gin Asp Leu Ala Leu Pro Lys Asp 
865 870 875 880 

tec tac ctg ata gac tac ttc etc ttt ctg aac egg tac ttg gaa gtg 2688 
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Ser Tyr Leu He Asp Tyr Phe Leu Phe Leu Asn Arg Tyr Leu Glu Val 

885 890 895 

ggg cct cca gtg tac ttt gac acc acc tea ggc tac aac ttt tec ace 
Gly Pro Pro Val Tyr Phe Asp Thr Thr Ser Gly Tyr Asn Phe Ser Thr 

900 905 " 910 



2736 



gag gca ggc atg aac gec att tgc tct agt gca ggc tgt gag age ttc 
Glu Ala Gly Met Asn Ala He Cys Ser Ser Ala Gly Cys Glu Ser Phe 
915 920 925 

tec eta acc cag aaa ate cag tat gee agt gaa ttc cct aat cag tct 
Ser Leu Thr Gin Lys He Gin Tyr Ala Ser Glu Phe Pro Asn Gin Ser 
930 935 * 940 

tat gtg get att get gca tec tec tgg gta gat gac ttc ate gac tgg 
Tyr Val Ala He Ala Ala Ser Ser Trp Val Asp Asp Phe He Asp Trp 
945 950 955 960 

ctg acc cca tec tec tec tgc tgc cgc att tat acc cgt ggc ccc cat 
Leu Thr Pro Ser Ser Ser Cys Cys Arg He Tyr Thr Arg Gly Pro His 

965 970 975 



2784 



2832 



2880 



2928 



aaa gat gag ttc tgt ccc tea acg gat act tec ttc aac tgt etc aaa 
Lys Asp Glu Phe Cys Pro Ser Thr Asp Thr Ser Phe Asn Cys Leu Lys 

980 "* 985 990 



2976 



aac tgc atg aac cgc act ctg ggt ccc gtg aga ccc aca aca gaa cag 
Asn Cys Met Asn Arg Thr Leu Gly Pro Val Arg Pro Thr Thr Glu Gin 
995 1000 1005 



3024 



ttt cat aag tac ctg ccc tgg ttc ctg aat gat acg ccc aac ate 
Phe His Lys Tyr Leu Pro Trp Phe Leu Asn Asp Thr Pro Asn lie 
1010 1015 1020 



3069 



aga tgt cct aaa ggg ggc eta gca gcg tat aga acc tct gtg aat 

Arg Cys Pro Lys Gly Gly Leu Ala Ala Tyr Arg Thr Ser Val Asn 
1025 1030 1035 

ttg age tea gat ggc cag att " ata gee tec cag ttc atg gee tac 

Leu Ser Ser Asp Gly Gin He He Ala Ser Gin Phe Met Ala Tyr 
1040 1045 1050 



3114 



3159 



cac aag ccc tta egg aac tea cag gac ttt aca gaa get etc egg 
His Lys Pro Leu Arg Asn Ser Gin Asp Phe Thr Glu Ala Leu Arg 
1055 1060 1065 



3204 



gca tec egg ttg eta gca gee aac ate aca get gaa eta egg aag 
Ala Ser Arg Leu Leu Ala Ala Asn He Thr Ala Glu Leu Arg Lys 
1070 1075 1080 



324 9 



gtg cct ggg aca gat ccc aac ttt gag gtc ttc cct tac acg ate 

Val Pro Gly Thr Asp Pro Asn Phe Glu Val Phe Pro Tyr Thr He 
1085 1090 1095 

tec aat gtg ttc tac cag caa tac ctg acg gtt etc cct gag gga 

Ser Asn Val Phe Tyr Gin Gin Tyr Leu Thr Val Leu Pro Glu Gly 
1100 1105 1110 



3294 



3339 
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ate ttc act ctt get etc tgc ttc gtg ccc acc ttt gtg gtc tgc 
lie Phe Thr Leu Ala Leu Cys Phe Val Pro Thr Phe Val Val Cys 
1115 1120 1125 

tac etc eta ctg ggc ctg gac ata cgc tea ggc ate etc aac ctg 
Tyr Leu Leu Leu Gly Leu Asp lie Arg Ser Gly lie Leu Asn Leu 
1130 1135 1140 

etc tec ate att atg ate etc gtg gac acc ate ggc etc atg get 
Leu Ser lie He Met He Leu Val Asp Thr He Gly Leu Met Ala 
1145 1150 1155 

gtg tgg ggt ate age tac aat get gtg tec etc ate aac ctt gtc 
Val Trp Gly He Ser Tyr Asn Ala Val Ser Leu He Asn Leu Val 
1160 1165 1170 



3384 



3429 



3474 



3519 



acg gca gtg ggc atg tct gtg gag ttc gtg tec cac att acc egg 
Thr Ala Val Gly Met Ser Val Glu Phe Val Ser His He Thr Arg 
1175 1180 1185 



3564 



tec ttt get gta age acc aag cct acc egg ctg gag aga gee aaa 
Ser Phe Ala Val Ser Thr Lys Pro Thr Arg Leu Glu Arg Ala Lys 
1190 1195 1200 



3609 



gat get act ate ttc atg ggc agt gcg gtg ttt get gga gtg gec 
Asp Ala Thr He Phe Met Gly Ser Ala Val Phe Ala Gly Val Ala 
1205 1210 1215 



3654 



atg acc aac ttc ccg ggc ate etc ate ctg ggc ttt get cag gee 
Met Thr Asn Phe Pro Gly He Leu He Leu Gly Phe Ala Gin Ala 
1220 1225 1230 



3699 



cag ctt ate cag att ttc ttc ttc cgc etc aac etc ctg ate acc 
Gin Leu He Gin He Phe Phe Phe Arg Leu Asn Leu Leu He Thr 
1235 1240 . 1245 



3744 



ttg ctg ggt ctg eta cac ggc ctg gtc ttc ctg ccc gtt gtc etc 
Leu Leu Gly Leu Leu His Gly Leu Val Phe Leu Pro Val Val Leu 
1250 1255 1260 



3789 



age tat ctg ggg cca gat gtt aac caa get ctg gta ctg gag gag 
Ser Tyr Leu Gly Pro Asp Val Asn Gin Ala Leu Val Leu Glu Glu 
1265 1270 1275 



3834 



aaa eta gee act gag gca gec atg gtc tea gag cct tct tgc cca 
Lys Leu Ala Thr Glu Ala Ala Met Val Ser Glu Pro Ser Cys Pro 
1280 1285 1290 



3879 



cag tac ccc ttc ccg get gat gca aac acc agt gac tat gtt aac 

Gin Tyr Pro Phe Pro Ala Asp Ala Asn Thr Ser Asp Tyr Val Asn 

1295 1300 1305 

tac ggc ttt aat cca gaa ttt ate cct gaa att aat get get age 

Tyr Gly Phe Asn Pro Glu Phe He Pro Glu lie Asn Ala Ala Ser 

1310 1315 1320 

age tct ctg ccc aaa agt gac caa aag ttc taa 

Ser Ser Leu Pro Lys Ser Asp Gin Lys Phe 
1325 1330 



3924 



3969 



4002 
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<210> 


12 


<211> 


1333 


<212> 


PRT 


<213> 


Mus 


<400> 


12 



Met Ala Ala Ala Trp Gin Gly Trp Leu Leu Trp Ala Leu Leu Leu Asn 
15 10 15 



Ser Ala Gin Gly Glu Leu Tyr Thr Pro Thr His Lys Ala Gly Phe Cys 

20 25 30 



Thr Phe Tyr Glu Glu Cys Gly Lys Asn Pro Glu Leu Ser Gly Gly Leu 
35 40 45 



Thr Ser Leu Ser Asn lie Ser Cys Leu Ser Asn Thr Pro Ala Arg His 
50 55 60 



Val Thr Gly Asp His Leu Ala Leu Leu Gin Arg Val Cys Pro Arg Leu 
65 70 75 80 



Tyr Asn Gly Pro Asn Asp Thr Tyr Ala Cys Cys Ser Thr Lys Gin Leu 

85 90 95 



Val Ser Leu Asp Ser Ser Leu Ser lie Thr Lys Ala Leu Leu Thr Arg 

100 105 110 



Cys Pro Ala Cys Ser Glu Asn Phe Val Ser He His Cys His Asn Thr 
115 120 125 



Cys Ser Pro Asp Gin Ser Leu Phe He Asn Val Thr Arg Val Val Gin 
130 135 140 



Arg Asp Pro Gly Gin Leu Pro Ala Val Val Ala Tyr Glu Ala Phe Tyr 
145 150 155 160 

Gin Arg Ser Phe Ala Glu Lys Ala Tyr Glu Ser Cys Ser Arg Val Arg 

165 170 175 



He Pro Ala Ala Ala Ser Leu Ala Val Gly Ser Met Cys Gly Val Tyr 
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180 185 190 



Gly Ser Ala Leu Cys Asn Ala Gin Arg Trp Leu Asn Phe Gin Gly Asp 
195 200 205 



Thr Gly Asn Gly Leu Ala Pro Leu Asp lie Thr Phe His Leu Leu Glu 
210 215 220 

Pro Gly Gin Ala Leu Ala Asp Gly Met Lys Pro Leu Asp Gly Lys He 
225 230 235 ** 240 

Thr Pro Cys Asn Glu Ser Gin Gly Glu Asp Ser Ala Ala Cys Ser Cys 

245 250 255 

* 

Gin Asp Cys Ala Ala Ser Cys Pro Val He Pro Pro Pro Pro Ala Leu 

260 265 270 

Arg Pro Ser Phe Tyr Met Gly Arg Met Pro Gly Trp Leu Ala Leu He 
275 280 285 

lie He Phe Thr Ala Val Phe Val Leu Leu Ser Val Val Leu Val Tyr 
290 295 300 

Leu Arg Val Ala Ser Asn Arg Asn Lys Asn Lys Thr Ala Gly Ser Gin 
305 310 315 320 

Glu Ala Pro Asn Leu Pro Arg Lys Arg Arg Phe Ser Pro His Thr Val 

325 330 335 

Leu Gly Arg Phe Phe Glu Ser Trp Gly Thr Arg Val Ala Ser Trp Pro 

340 345 350 

Leu Thr Val Leu Ala Leu Ser Phe He Val Val He Ala Leu Ser Val 
355 360 365 

Gly Leu Thr Phe He Glu Leu Thr Thr Asp Pro Val Glu Leu Trp Ser 
370 375 380 

Ala Pro Lys Ser Gin Ala Arg Lys Glu Lys Ala Phe His Asp Glu His 
385 390 395 400 



Phe Gly Pro Phe Phe Arg Thr Asn Gin He Phe Val Thr Ala Lys Asn 

405 410 415 
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Arg Ser Ser Tyr Lys Tyr Asp Ser Leu Leu Leu Gly Pro Lys Asn Phe 

420 " 425 430 



Ser Gly lie Leu Ser Leu Asp Leu Leu Gin Glu Leu Leu Glu Leu Gin 
435 440 445 



Glu Arg Leu Arg His Leu Gin Val Trp Ser His Glu Ala Gin Arg Asn 
450 455 460 



lie Ser Leu Gin Asp lie Cys Tyr Ala Pro Leu Asn Pro His Asn Thr 
465 470 475 480 



Ser Leu Thr Asp Cys Cys Val Asn Ser Leu Leu Gin Tyr Phe Gin Asn 

485 490 495 



Asn His Thr Leu Leu Leu Leu Thr Ala Asn Gin Thr Leu Asn Gly Gin 

500 505 510 



Thr Ser Leu Val Asp Trp Lys Asp His Phe Leu Tyr Cys Ala Asn Ala 
515 520 525 



Pro Leu Thr Tyr Lys Asp Gly Thr Ala Leu Ala Leu Ser Cys lie Ala 
530 535 540 



Asp Tyr Gly Ala Pro Val Phe Pro Phe Leu Ala Val Gly Gly Tyr Gin 
545 550 555 560 



Gly Thr Asp Tyr Ser Glu Ala Glu Ala Leu lie He Thr Phe Ser lie 

565 570 575 



Asn Asn Tyr Pro Ala Asp Asp Pro Arg Met Ala His Ala Lys Leu Trp 

580 585 590 



Glu Glu Ala Phe Leu Lys Glu Met Gin Ser Phe Gin Arg Ser Thr Ala 
595 600 605 



Asp Lys Phe Gin He Ala Phe Ser Ala Glu Arg Ser Leu Glu Asp Glu 
610 615 620 



He Asn Arg Thr Thr He Gin Asp Leu Pro Val Phe Ala He Ser Tyr 
625 630 635 640 



Leu He Val Phe Leu Tyr He Ser Leu Ala Leu Gly Ser Tyr Ser Arg 

645 650 655 
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Trp Ser Arg Val Ala Val Asp Ser Lys Ala Thr Leu Gly Leu Gly Gly 

660 665 670 



Val Ala Val Val Leu Gly Ala Val Val Ala Ala Met Gly Phe Tyr Ser 
675 680 685 



Tyr Leu Gly Val Pro Ser Ser Leu Val lie lie Gin Val Val Pro Phe 
690 695 700 



Leu Val Leu Ala Val Gly Ala Asp Asn He Phe He Phe Val Leu Glu 
705 710 715 720 

Tyr Gin Arg Leu Pro Arg Met Pro Gly Glu Gin Arg Glu Ala His lie 

725 730 735 

Gly Arg Thr Leu Gly Ser Val Ala Pro Ser Met Leu Leu Cys Ser Leu 

740 745 750 



Ser Glu Ala He Cys Phe Phe Leu Gly Ala Leu Thr Ser Met Pro Ala 
755 760 765 



Val Arg Thr Phe Ala Leu Thr Ser Gly Leu Ala He He Phe Asp Phe 
770 775 780 



Leu Leu Gin Met Thr Ala Phe Val Ala Leu Leu Ser Leu Asp Ser Lys 
785 790 795 " 800 

Arg Gin Glu Ala Ser Arg Pro Asp Val Val Cys Cys Phe Ser Ser Arg 

805 810 815 



Asn Leu Pro Pro Pro Lys Gin Lys Glu Gly Leu Leu Leu Cys Phe Phe 

820 825 830 

Arg Lys He Tyr Thr Pro Phe Leu Leu His Arg Phe He Arg Pro Val 
835 840 845 

Val Leu Leu Leu Phe Leu Val Leu Phe Gly Ala Asn Leu Tyr Leu Met 
850 855 860 

Cys Asn He Ser Val Gly Leu Asp Gin Asp Leu Ala Leu Pro Lys Asp 
865 870 875 880 



Ser Tyr Leu He Asp Tyr Phe Leu Phe Leu Asn Arg Tyr Leu Glu Val 

885 890 895 
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Gly Pro Pro Val Tyr Phe Asp Thr Thr Ser Gly Tyr Asn Phe Ser Thr 

900 905 910 



Glu Ala Gly Met Asn Ala lie Cys Ser Ser Ala Gly Cys Glu Ser Phe 
915 920 925 



Ser Leu Thr Gin Lys He Gin Tyr Ala Ser Glu Phe Pro Asn Gin Ser 
930 935 940 



Tyr Val Ala He Ala Ala Ser Ser Trp Val Asp Asp Phe lie Asp Trp 
945 950 955 960 



Leu Thr Pro Ser Ser Ser Cys Cys Arg He Tyr Thr Arg Gly Pro His 

965 970 975 



Lys Asp Glu Phe Cys Pro Ser Thr Asp Thr Ser Phe Asn Cys Leu Lys 

980 985 990 



Asn Cys Met Asn Arg Thr Leu Gly Pro Val Arg Pro Thr Thr Glu Gin 
995 1000 1005 



Phe His Lys Tyr Leu Pro Trp Phe Leu Asn Asp Thr Pro Asn lie 
1010 1015 1020 



Arg Cys Pro Lys Gly Gly Leu Ala Ala Tyr Arg Thr Ser Val Asn 
1025 1030 1035 



Leu Ser Ser Asp Gly Gin lie He Ala Ser Gin Phe Met Ala Tyr 
1040 1045 1050 



His Lys Pro Leu Arg Asn Ser Gin Asp Phe Thr Glu Ala Leu Arg 
1055 1060 1065 



Ala Ser Arg Leu Leu Ala Ala Asn He Thr Ala Glu Leu Arg Lys 
1070 1075 1080 



Val Pro Gly Thr Asp Pro Asn Phe Glu Val Phe Pro Tyr Thr He 
1085 1090 1095 



Ser Asn Val Phe Tyr Gin Gin Tyr Leu Thr Val Leu Pro Glu Gly 
1100 1105 1110 

He Phe Thr Leu Ala Leu Cys Phe Val Pro Thr Phe Val Val Cys 
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1115 1120 1125 



Tyr Leu Leu Leu Gly Leu Asp lie Arg Ser Gly lie Leu Asn Leu 
1130 1135 " 1140 



Leu Ser lie lie Met lie Leu Val Asp Thr lie Gly Leu Met Ala 
1145 1150 1155 



Val Trp Gly lie Ser Tyr Asn Ala Val Ser Leu lie Asn Leu Val 
1160 1165 1170 



Thr Ala Val Gly Met Ser Val Glu Phe Val Ser His lie Thr Arg 
1175 1180 1185 



Ser Phe Ala Val Ser Thr Lys Pro Thr Arg Leu Glu Arg Ala Lys 
1190 1195 ~~ 1200 



Asp Ala Thr lie Phe Met Gly Ser Ala Val Phe Ala Gly Val Ala 
1205 1210 1215 



Met Thr Asn Phe Pro Gly lie Leu He Leu Gly Phe Ala Gin Ala 
1220 1225 1230 

Gin Leu He Gin lie Phe Phe Phe Arg Leu Asn Leu Leu He Thr 
1235 1240 1245 

Leu Leu Gly Leu Leu His Gly Leu Val Phe Leu Pro Val Val Leu 
1250 1255 1260 

Ser Tyr Leu Gly Pro Asp Val Asn Gin Ala Leu Val Leu Glu Glu 
1265 1270 1275 

Lys Leu Ala Thr Glu Ala Ala Met Val Ser Glu Pro Ser Cys Pro 
1280 1285 1290 

Gin Tyr Pro Phe Pro Ala Asp Ala Asn Thr Ser Asp Tyr Val Asn 
1295 1300 1305 



Tyr Gly Phe Asn Pro Glu Phe lie Pro Glu He Asn Ala Ala Ser 
1310 1315 1320 

Ser Ser Leu Pro Lys Ser Asp Gin Lys Phe 
1325 1330 
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<210> 13 
<211> 3999 
<212> DNA 
<213> Mus sp. 
<220> 

<221> misc_ feature 

<222> (1) . . (3999) 

<223> n is g or a or t or c 

<400> 13 

atggcngcng cntggcargg ntggytnytn tgggcnytny tnytnaayws ngcncarggn 60 

garytntaya cnccnacnca yaargcnggn ttytgyacnt tytaygarga rtgyggnaar 120 

aayccngary tnwsnggngg nytnacnwsn ytnwsnaaya thwsntgyyt nwsnaayacn 180 

ccngcnmgnc aygtnacngg ngaycayytn gcnytnytnc armgngtntg yccnmgnytn 240 

tayaayggnc cnaaygayac ntaygcntgy tgywsnacna arcarytngt nwsnytngay 300 

wsnwsnytnw snathacnaa rgcnytnytn acnntgntgyc cngcntgyws ngaraaytty 360 

gtnwsnathc aytgycayaa yacntgywsn ccngaycarw snytnttyat haaygtnacn 420 

mgngtngtnc armgngaycc nggncarytn ccngcngtng tngcntayga rgcnttytay 480 

carmgnwsnt tygcngaraa rgcntaygar wsntgywsnra gngtnrognat hccngcngcn 540 

gcnwsnytng cngtnggnws natgtgyggn gtntayggnw sngcnytntg yaaygcncar 600 

mgntggytna ayttycargg ngayacnggn aayggnytng cnccnytnga yathacntty 660 

cayytnytng arccnggnca rgcnytngcn gayggnatga arccnytnga yggnaarath 720 

acnccntgya aygarwsnca rggngargay wsngcngcnt gywsntgyca rgaytgygcn 780 

gcnwsntgyc cngtnathcc nccnccnccn gcnytnmgnc cnwsnttyta yatgggnmgn 84 0 

atgccnggnt ggytngcnyt nathathath ttyacngcng tnttygtnyt nytnwsngtn 900 

gtnytngtnt ayytnmgngt ngcnwsnaay mgnaayaara ayaaracngc nggnwsncar 960 

gargcnccna ayytnccnmg naarmgnmgn ttywsnccnc ayacngtnyt nggnmgntty 1020 

ttygarwsnt ggggnacnmg ngtngcnwsn tggccnytna cngtnytngc nytnwsntty 1080 

athgtngtna thgcnytnws ngtnggnytn acnttyathg arytnacnac ngayccngtn 1140 

garytntggw sngcnccnaa rwsncargcn mgnaargara argent t yea ygaygarcay 1200 

ttyggnccnt tyttymgnac naaycarath ttygtnacng cnaaraaymg nwsnwsntay 1260 
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aartaygayw snytnytnyt nggnccnaar aayttywsng gnathytnws nytngayytn 1320 

ytncargary tnytngaryt ncargarmgn ytnmgncayy tncargtntg gwsncaygar 1380 

gcncarmgna ayathwsnyt ncargayath tgytaygcnc cnytnaaycc ncayaayacn 1440 

wsnytnacng aytgytgygt naaywsnytn ytncartayt tycaraayaa ycayacnytn 1500 

ytnytnytna cngcnaayca racnytnaay ggncaracnw snytngtnga ytggaargay 1560 

cayttyytnt aytgygcnaa ygcnccnytn acntayaarg ayggnacngc nytngcnytn 1620 

wsntgyathg cngaytaygg ngcnccngtn ttyccnttyy tngcngtngg nggntaycar 1680 

ggnacngayt aywsngargc ngargcnytn athathacnt tywsnathaa yaaytayccn 1740 

gcngaygayc cnmgnatggc ncaygcnaar ytntgggarg argcnttyyt naargaratg 1800 

carwsnttyc arragnwsnac ngcngayaar ttycarathg cnttywsngc ngarmgnwsn 1860 

ytngargayg arathaaymg nacnacnath cargayytnc cngtnttygc nathwsntay 1920 

ytnathgtnt tyytntayat hwsnytngcn ytnggnwsnt aywsnmgntg gwsningngtn 1980 

gcngtngayw snaargcnac nytnggnytn ggnggngtng cngtngtnyt nggngcngtn 2040 

gtngcngcna tgggnttyta ywsntayytn ggngtnccnw snwsnytngt nathathcar 2100 

gtngtnccnt tyytngtnyt ngcngtnggn gcngayaaya thttyathtt ygtnytngar 2160 

taycarmgny tnccnmgnat gccnggngar carmgngarg cncayathgg nmgnacnytn 2220 

ggnwsngtng cnccnwsnat gytnytntgy wsnytnwsng argcnathtg yttyttyytn 2280 

ggngcnytna cnwsnatgcc ngcngtnmgn acnttygcny tnacnwsngg nytngcnath 2340 

athttygayt tyytnytnca ratgacngcn ttygtngcny tnytnwsnyt ngaywsnaar 2400 

mgncargarg cnwsnmgncc ngaygtngtn tgytgyttyw snwsnmgnaa yytnccnccn 24 60 

ccnaarcara argarggnyt nytnytntgy ttyttymgna arathtayac nccnttyytn 2520 

ytncaymgnt tyathmgncc ngtngtnytn ytnytnttyy tngtnytntt yggngcnaay 2580 

ytntayytna tgtgyaayat hwsngtnggn ytngaycarg ayytngcnyt nccnaargay 2640 

wsntayytna thgaytaytt yytnttyytn aaymgntayy tngargtngg nccnccngtn 2700 

tayttygaya cnacnwsngg ntayaaytty wsnacngarg cnggnatgaa ygcnathtgy 2760 

wsnwsngcng gntgygarws nttywsnytn acncaraara thcartaygc nwsrvgartty 2820 

ccnaaycarw sntaygtngc nathgcngcn wsnwsntggg tngaygaytt yathgaytgg 2880 

ytnacnccnw snwsnwsntg ytgymgnath tayacnrogng gnccncayaa rgaygartty 294 0 

tgyccnwsna cngayacnws nttyaaytgy ytnaaraayt gyatgaaymg nacnytnggn 3000 
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ccngtnmgnc cnacnacnga rcarttycay aartayytnc cntggttyyt naaygayacn 3060 

ccnaayathm gntgyccnaa rggnggnytn gcngcntaym gnacnwsngt naayytnwsn 3120 

wsngayggnc ara-thathgc nwsncartty atggcntayc ayaarccnyt nmgnaaywsn 3180 

cargayttya cngargcnyt nmgngcnwsn mgnytnytng cngcnaayat hacngcngar 324 0 

ytnmgnaarg tnccnggnac ngayccnaay ttygargtnt tyccntayac nathwsnaay 3300 

gtnttytayc arcartayyt nacngtnytn ccngarggna thttyacnyt ngcnytntgy 3360 

ttygtnccna cnttygtngt ntgytayytn ytnytnggny tngayathmg nwsnggnath 3420 

ytnaayytny tnwsnathat hatgathytn gtngayacna thggnytnat ggcngtntgg 3480 

ggnathwsnt ayaaygcngt nwsnytnath aayytngtna cngcngtngg natgwsngtn 3540 

garttygtnw sncayathac nmgnwsntty gcngtnwsna cnaarccnac nmgnytngar 3600 

mgngcnaarg aygcnacnat httyatgggn wsngcngtnt tygcnggngt ngcnatgacn 3660 

aayttyccng gna-fchytnat hytnggntty gcncargcnc arytnathca rathttytty 3720 

ttymgnytna ayy-bnytnat hacnytnytn ggnytnytnc ayggnytngt nttyytnccn 3780 

gtngtnytnw sntayytngg nccngaygtn aaycargcny tngtnytnga rgaraarytn 3840 

gcnacngarg cngcnatggt nwsngarccn wsntgyccnc artayccntt yccngcngay 3900 

gcnaayacnw sngaytaygt naaytayggn ttyaayccng arttyathcc ngarathaay 3960 

gcngcnwsnw snwsnytncc naarwsngay caraartty 3999 

<210> 14 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 14 

tcttcaccct tgctctttgc 20 

<210> 15 
<211> 24 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> primer 

<400> 15 

aatgatggag agtaggttga ggat 24 

<210> 16 

<211> 26 

<212> DRA 

<213> Artificial Sequence 



<220> 

<223> primer 

<400> 16 

tgcccacctt tgttgtctgc taccta 26 

<210> 17 

<211> 21 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 

<400> 17 

atcgctgaca ggatgcagaa g 21 

<210> 18 

<211> 22 

<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> primer 

<400> 18 

tcaggaggag caatgatctt ga 22 

<210> 19 

<211> 30 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 

<400> 19 

agattactgc cctggctcct agcaccatta 30 

<210> 20 

<211> 20 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 

<400> 20 

atcctcatcc tgggctttgc 20 

<210> 21 

<211> 21 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 



<400> 21 

gcaaggtgat caggaggttg a 



21 



WO 2005/069900 PCT/US2005/001469 

52/78 



<210> 22 

<211> 29 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 

<400> 22 

cccagcttat ccagattttc ttcttccgc 29 

<210> 23 

<211> 20 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 

<400> 23 

tcttcaccct tgctctttgc 20 

<210> 24 

<211> 24 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 
<400> 24 

aatgatggag agtaggttga ggat 24 
<210> 25 



<211> 24 
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<212> DNA 



<213> Artificial Sequence 



<220> 

<223> primer 
<400> 25 

tgcccacctt tgttgtctgc tacc 



24 



<210> 26 



<211> 23 



<212> DNA 



<213> Artificial Sequence 



<220> 

<223> primer 
<400> 26 

agcacctgtc cactgaagat ttc 



23 



<210> 27 

<211> 21 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 
<400> 27 

tggacgctga gcttcagttc t 



21 



<210> 28 
<211> 24 



<212> DNA 



<213> Artificial Sequence 
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<220> 

<223> primer 

<400> 28 

cttctctgcg ctgcctcgat ggaa 24 

<210> 29 

<211> 21 

<212> DNA . 

<213> Artificial Sequence 



<220> 

<223> primer 

<400> 29 

agtaaaaagg gctcgcagga t 21 

<210> 30 

<211> 21 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 

<400> 30 

ggcagctggt gacatcagag a 21 

<210> 31 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
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<400> 31 

aggaggccat gcaggcctac tctga 



25 



<210> 



32 



<211> 



21 



<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<40O> 32 

gagtccacgg tcagtccatg t 21 

<210> 33 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 33 

ttatgaacaa caatgccaag caa 23 

<210> 34 

<211> 34 

<212> DNA 



<213> 



Artificial Sequence 



<220> 



<223> 



primer 



<400> 34 

agtccttagg tagtggctta gtccctggaa gctc 



34 
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<210> 35 

<211> 52 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> probe 

<400> 35 

gtaatacgac tcactatagg gccctgaccjg tccttcctga gggaatcttc ac 52 

<210> 36 

<211> 50 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> probe 

<400> 36 

gtaatacgac tcactatagg gcctgggaag ttggtcatgg ccactccagc 50 

<210> 37 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

» 

<223> FLAG tag 
<400> 3.7 

Asp Tyr Lys Asp Asp Asp Asp Lys 
1 5 

♦ 

<210> 38 
<211> 4 



WO 2005/069900 PCT/US2005/001469 

57/78 



<212> PRT 

<213> Artificial Sequence 



<220> 

<223> motif 

<400> 38 

Tyr Gin Arg Leu 
1 

<210> 39 

<211> 19 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> antigen 
<400> 39 

Glu Gin Phe His Lys Tyr Leu Pro Trp Phe Leu Asn Asp Pro Pro Asn 
1 5 10 15 

lie Arg Cys 



<210> 40 

<211> 16 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> antigen 

<400> 40 

Glu Ala Phe Tyr Gin Arg Ser Phe Ala Glu Lys Ala Tyr Glu Ser Cys 
1 5 10 15 
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<210> 41 

<211> 15 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> antigen 

<400> 41 

Gly Gin Thr Ser Leu Val Asp Trp Lys Asp His Phe Leu Tyr Cys 
1 5 - 10 15 

<210> 42 

<211> 17 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> antigen 

<400> 42 

Cys Ala Asn Ala Pro Leu Thr Phe Lys Asp Gly Thr Ala Leu Ala Leu 
1 5 10 15 

Ser 

<210> 43 

<211> 5092 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDS 

<222> (57) . . (4136) 
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<223> 



<400> 43 

cttggctgtt cctgaggcct ggcctggctc cccgctgacc ccttcccaga cctggg atg 59 

Met 
1 

gcg gag gcc ggc ctg agg ggc tgg ctg ctg tgg gcc ctg etc ctg cgc 107 
Ala Glu Ala Gly Leu Arg Gly Trp Leu Leu Trp Ala Leu Leu Leu Arg 

5 10 15 

ttg gcc cag agt gag cct tac aca acc ate cac cag cct ggc tac tgc 155 
Leu Ala Gin Sear Glu Pro Tyr Thr Thr He His Gin Pro Gly Tyr Cys 
20 25 30 

gcc ttc tat gac gaa tgt ggg aag aac cca gag ctg tct gga age etc 203 
Ala Phe Tyr Asp Glu Cys Gly Lys Asn Pro Glu Leu Ser Gly Ser Leu 
35 40 45 

atg aca etc tec aac gtg tec tgc ctg tec aac acg ccg gcc cgc aag 251 
Met Thr Leu Ser: Asn Val Ser Cys Leu Ser Asn Thr Pro Ala Arg Lys 
50 55 60 65 

ate aca ggt gat cac ctg ate eta tta cag aag ate tgc ccc cgc etc 299 
He Thr Gly Asp His Leu He Leu Leu Gin Lys He Cys Pro Arg Leu 

70 75 ~ 80 

tac acc ggc ccc aac acc caa gcc tgc tgc tec gcc aag cag ctg gta 347 
Tyr Thr Gly Pro Asn Thr Gin Ala Cys Cys Ser Ala Lys Gin Leu Val 

85 90 95 

tea ctg gaa gcg agt ctg teg ate acc aag gcc etc etc acc cgc tgc 395 
Ser Leu Glu Ala Ser Leu Ser He Thr Lys Ala Leu Leu Thr Arg Cys 
100 105 110 

cca gcc tgc tct gac aat ttt gtg aac ctg cac tgc cac aac acg tgc 443 
Pro Ala Cys Ser- Asp Asn Phe Val Asn Leu His Cys His Asn Thr Cys 
115 120 125 

age ccc aat cag age etc ttc ate aat gtg acc cgc gtg gcc cag eta 491 
Ser Pro Asn Gin Ser Leu Phe He Asn Val Thr Arg Val Ala Gin Leu 
130 135 140 145 

ggg get gga caa etc cca get gtg gtg gcc tat gag gcc ttc tac cag 539 
Gly Ala Gly Gin Leu Pro Ala Val Val Ala Tyr Glu Ala Phe Tyr Gin 

150 155 160 

cat age ttt gcc gag cag age tat gac tec tgc age cgt gtg cgc gtc 587 
His Ser Phe Ala Glu Gin Ser Tyr Asp Ser Cys Ser Arg Val Arg Val 

165 170 175 

cct gca get gcc acg ctg get gtg ggc acc atg tgt ggc gtg tat ggc 635 
Pro Ala Ala Ala Thr Leu Ala Val Gly Thr Met Cys Gly Val Tyr Glv 
180 185 190 

tct gcc ctt tgc aat gcc cag cgc tgg etc aac ttc cag gga gac aca 683 
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Ser Ala Leu Cys Asn Ala Gin Arg Trp Leu Asn Phe Gin Gly Asp Thr 
195 200 205 

ggc aat ggt ctg gcc cca ctg gac ate acc ttc cac etc ttg gag cct 731 
Gly Asn Gly Leu Ala Pro Leu Asp He Thr Phe His Leu Leu Glu Pro 
210 215 220 225 

ggc cag gcc gtg ggg agt ggg att cag cct ctg aat gag ggg gtt gca 779 
Gly Gin Ala Val Gly Ser Gly He Gin Pro Leu Asn Glu Gly Val Ala 

230 235 240 

cgt tgc aat gag tec caa ggt gac gac gtg gcg acc tgc tec tgc caa 827 
Arg Cys Asn Glu Ser Gin Gly Asp Asp Val Ala Thr Cys Ser Cys Gin 

245 250 255 

gac tgt get gca tec tgt cct gcc ata gcc cgc ccc cag gcc etc gac 875 
Asp Cys Ala Ala Ser Cys Pro Ala He Ala Arg Pro Gin Ala Leu Asp 
260 265 270 

tec acc ttc tac ctg ggc cag atg ccg ggc agt ctg gtc etc ate ate 923 
Ser Thr Phe Tyr Leu Gly Gin Met Pro Gly Ser Leu Val Leu lie He 
275 280 285 

ate etc tgc tct gtc ttc get gtg gtc acc ate ctg ctt gtg gga ttc 971 
He Leu Cys Ser Val Phe Ala Val Val Thr He Leu Leu Val Gly Phe 
290 295 300 305 

cgt gtg gcc ccc gcc agg gac aaa age aag atg gtg gac ccc aag aag 1019 
Arg Val Ala Pro Ala Arg Asp Lys Ser Lys Met Val Asp Pro Lys Lys 

310 315 320 

ggc acc age etc tct gac aag etc age ttc tec acc cac acc etc ctt 1067 
Gly Thr Ser Leu Ser Asp Lys Leu Ser Phe Ser Thr His Thr Leu Leu 

325 330 335 

ggc cag ttc ttc cag ggc tgg ggc acg tgg gtg get teg tgg cct ctg 1115 
Gly Gin Phe Phe Gin Gly Trp Gly Thr Trp Val Ala Ser Trp Pro Leu 
340 345 350 

acc ate ttg gtg eta tct gtc ate ccg gtg gtg gcc ttg gca gcg ggc 1163 
Thr lie Leu Val Leu Ser Val He Pro Val Val Ala Leu Ala Ala Gly 
355 360 365 

ctg gtc ttt aca gaa etc act acg gac ccc gtg gag ctg tgg teg gcc 1211 
Leu Val Phe Thr Glu Leu Thr Thr Asp Pro Val Glu Leu Trp Ser Ala 
370 375 380 385 

ccc aac age caa gcc egg agt gag aaa get ttc cat gac cag cat ttc 1259 
Pro Asn Ser Gin Ala Arg Ser Glu Lys Ala Phe His Asp Gin His Phe 

390 395 400 

ggc ccc ttc ttc cga acc aac cag gtg ate ctg acg get cct aac egg 1307 
Gly Pro Phe Phe Arg Thr Asn Gin Val He Leu Thr Ala Pro Asn Arg 

405 410 415 

tec age tac agg tat gac tct ctg ctg ctg ggg ccc aag aac ttc age 1355 
Ser Ser Tyr Arg Tyr Asp Ser Leu Leu Leu Gly Pro Lys Asn Phe Ser 
420 425 430 
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gga ate ctg gac ctg gac ttg ctg ctg gag ctg eta gag ctg cag gag 1403 
Gly He Leu Asp Leu Asp Leu Leu Leu Glu Leu Leu Glu Leu Gin Glu 
435 440 445 

agg ctg egg cac etc cag gta tgg teg ccc gaa gca cag cgc aac ate 1451 
Arg Leu Arg His Leu Gin Val Trp Ser Pro Glu Ala Gin Arg Asn lie 
450 455 460 465 

tec ctg cag gac ate tgc tac gec ccc etc aat ccg gac aat acc agt 14 99 

Ser Leu Gin Asp He Cys Tyr Ala Pro Leu Asn Pro Asp Asn Thr Ser 

470 475 480 

etc tac gac tgc tgc ate aac age etc ctg cag tat ttc cag aac aac 1547 
Leu Tyr- Asp Cys Cys He Asn Ser Leu Leu Gin Tyr Phe Gin Asn Asn 

485 490 495 

cgc acc/ etc ctg ctg etc aca gee aac cag aca ctg atg ggg cag acc 1595 
Arg Thr- Leu Leu Leu Leu Thr Ala Asn Gin Thr Leu Met Gly Gin Thr 
500 505 510 

tec caa gtc gac tgg aag gac cat ttt ctg tac tgt gee aat gec ccg 1643 
Ser Gin Val Asp Trp Lys Asp His Phe Leu Tyr Cys Ala Asn Ala Pro 
515 520 525 

etc acc ttc aag gat ggc aca gee ctg gee ctg age tgc atg get gac 1691 
Leu Thar Phe Lys Asp Gly Thr Ala Leu Ala Leu Ser Cys Met Ala Asp 
530 535 540 545 

tac ggg gec cct gtc ttc ccc ttc ctt gec att ggg ggg tac aaa gga 1739 
Tyr Gly Ala Pro Val Phe Pro Phe Leu Ala lie Gly Gly Tyr Lys Gly 

550 555 560 

aag gac tat tct gag gca gag gec ctg ate atg acg ttc tec etc aac 1787 
Lys Asp Tyr Ser Glu Ala Glu Ala Leu He Met Thr Phe Ser Leu Asn 

565 570 575 

aat tac cct gee ggg gac ccc cgt ctg gee cag gec aag ctg tgg gag 1835 
Asn Tyr Pro Ala Gly Asp Pro Arg Leu Ala Gin Ala Lys Leu Trp Glu 
580 585 590 . 

gag gec ttc tta gag gaa atg cga gee ttc cag cgt egg atg get ggc 1883 
Glu Ala Phe Leu Glu Glu Met Arg Ala Phe Gin Arg Arg Met Ala Gly 
595 600 605 

atg ttc cag gtc acg ttc atg get gag cgc tct ctg gaa gac gag ate 1931 
Met Phe Gin Val Thr Phe Met Ala Glu Arg Ser Leu Glu Asp Glu He 
610 615 620 625 

aat cgc acc aca get gaa gac ctg ccc ate ttt gee acc age tac att 1979 
Asn Arc/ Thr Thr Ala Glu Asp Leu Pro He Phe Ala Thr Ser Tyr He 

630 635 640 

gtc ata. ttc ctg tac ate tct ctg gee ctg ggc age tat tec age tgg 2027 
Val He Phe Leu Tyr He Ser Leu Ala Leu Gly Ser Tyr Ser Ser Trp 

645 650 655 

age cga gtg atg gtg gac tec aag gee acg ctg ggc etc ggc ggg gtg 2075 
Ser Arg Val Met Val Asp Ser Lys Ala Thr Leu Gly Leu Gly Gly Val 
660 665 670 
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gcc gtg gtc ctg gga gca gtc atg get gec atg ggc ttc ttc tec tac 2123 
Ala Val Val Leu Gly Ala Val Met Ala Ala Met Gly Phe Phe Ser Tyr 
675 680 685 

ttg ggt ate cgc tec tec ctg gtc ate ctg caa gtg gtt cct ttc ctg 2171 
Leu Gly He Arg Ser Ser Leu Val He Leu Gin Val Val Pro Phe Leu 
690 695 700 705 

gtg ctg tec gtg ggg get gat aac ate ttc ate ttt gtt etc gag tac 2219 
Val Leu Ser Val Gly Ala Asp Asn He Phe He Phe Val Leu Glu Tyr 

710 715 720 

cag agg ctg ccc egg agg cct ggg gag cca cga gag gtc cac att ggg 2267 
Gin Arg Leu Pro Arg Arg Pro Gly Glu Pro Arg Glu Val His He Gly 

725 730 735 

cga gcc eta ggc agg gtg get ccc age atg ctg ttg tgc age etc tct 2315 
Arg Ala Leu Gly Axg Val Ala Pro Ser Met Leu Leu Cys Ser Leu Ser 
740 745 750 

gag gcc ate tgc ttc ttc eta ggg gcc ctg acc ccc atg cca get gtg 2363 
Glu Ala He Cys Phe Phe Leu Gly Ala Leu Thr Pro Met Pro Ala Val 
755 760 765 

egg acc ttt gcc ctg acc tct ggc ctt gca gtg ate ctt gac ttc etc 2411 
Arg Thr Phe Ala Leu Thr Ser Gly Leu Ala Val He Leu Asp Phe Leu 
770 775 780 785 

ctg cag atg tea gcc ttt gtg gcc ctg etc tec ctg gac age aag agg 2459 
Leu Gin Met Ser Ala Phe Val Ala Leu Leu Ser Leu Asp Ser Lys Arg 

7 90 795 800 

cag gag gcc tec egg ttg gac gtc tgc tgc tgt gtc aag ccc cag gag 2507 
Gin Glu Ala Ser Arg Leu Asp Val Cys Cys Cys Val Lys Pro Gin Glu 

B05 810 * 815 

ctg ccc ccg cct ggc cag gga gag ggg etc ctg ctt ggc ttc ttc caa 2555 
Leu Pro Pro Pro Gly Gin Gly Glu Gly Leu Leu Leu Gly Phe Phe Gin 
820 825 830 

aag get tat gcc ccc ttc ctg ctg cac tgg ate act cga ggt gtt gtg 2603 
Lys Ala Tyr Ala Pro Phe Leu Leu His Trp He Thr Arg Gly Val Val 
835 840 845 

ctg ctg ctg ttt etc gcc ctg ttc gga gtg age etc tac tec atg tgc 2651 
Leu Leu Leu Phe Leu Ala Leu Phe Gly Val Ser Leu Tyr Ser Met Cys 
850 855 860 865 

cac ate age gtg gga ctg gac cag gag ctg gcc ctg ccc aag gac teg 2699 
His He Ser Val Gly Leu Asp Gin Glu Leu Ala Leu Pro Lys Asp Ser 

870 875 880 

tac ctg ctt gac tat ttc etc ttt ctg aac. cgc tac ttc gag gtg ggg 2747 
Tyr Leu Leu Asp Tyr Phe Leu Phe Leu Asn Arg Tyr Phe Glu Val Gly 

885 890 895 

gcc ccg gtg tac ttt gtt acc acc ttg ggc tac aac ttc tec age gag 2795 
Ala Pro Val Tyr Phe Val Thr Thr Leu Gly Tyr Asn Phe Ser Ser Glu 
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900 



905 



910 



get ggg atg aat gec ate tgc tec agt gca ggc tgc aac aac ttc tec 
Ala Gly Met Asn Ala lie Cys Ser Ser Ala Gly Cys Asn Asn Phe Ser 
915 920 925 



2843 



ttc acc cag aag ate cag tat gec aca gag ttc cct gag cag tct tac 
Phe Thr Gin Lys lie Gin Tyr Ala Thr Glu Phe Pro Glu Gin Ser Tyr 
930 935 940 945 

ctg gec ate cct gec tec tec tgg gtg gat gac ttc att gac tgg ctg 
Leu Ala lie Pro Ala Ser Ser Trp Val Asp Asp Phe He Asp Trp Leu 

950 955 960 



2891 



2939 



acc ccg tec tec tgc tgc cgc ctt tat ata tct ggc ccc aat aag gac 
Thr Pro Ser Ser Cys Cys Arg Leu Tyr He Ser Gly Pro Asn Lys Asp 

965 970 975 



2987 



aag ttc tgc ccc teg acc gtc aac tct ctg aac tgc eta aag aac tgc 3035 
Lys Phe Cys Pro Ser Thr Val Asn Ser Leu Asn Cys Leu Lys Asn Cys 
980 985 * 990 

atg age ate acg atg ggc tct gtg agg ccc teg gtg gag cag ttc cat 3083 
Met Ser He Thr Met Gly Ser Val Arg Pro Ser Val Glu Gin Phe His 
995 1000 1005 



aag tat ctt ccc tgg ttc ctg aac gac egg ccc aac ate aaa tgt 
Lys Tyr Leu Pro Trp Phe Leu Asn Asp Arg Pro Asn lie Lys Cys 
1010 1015 1020 



3128 



ccc aaa ggc ggc ctg gca gca tac age acc tct gtg aac ttg act 
Pro Lys Gly Gly Leu Ala Ala Tyr- Ser Thr Ser Val Asn Leu Thr 
1025 1030 1035 



3173 



tea gat ggc cag gtt tta gac aca gtt gee att ctg tea ccc agg 

Ser Asp Gly Gin Val Leu Asp Thr Val Ala He Leu Ser Pro Arg 
1040 1045 1050 

ctg gag tac agt ggc aca ate teg get cac tgc aac etc tac etc 

Leu Glu Tyr Ser Gly Thr He Ser Ala His Cys Asn Leu Tyr Leu 
1055 1060 1065 



3218 



3263 



ctg gat tea gec tec agg ttc atg gee tat cac aag ccc ctg aaa 
Leu Asp Ser Ala Ser Arg Phe Met Ala Tyr His Lys Pro Leu Lys 
1070 1075 1080 



3308 



aac tea cag gat tac aca gaa get ctg egg gca get cga gag ctg 
Asn Ser Gin Asp Tyr Thr Glu Ala Leu Arg Ala Ala Arg Glu Leu 
1085 1090 1095 



3353 



gca gec aac ate act get gac ctg egg aaa gtg cct gga aca gac 
Ala Ala Asn He Thr Ala Asp Leu Arg Lys Val Pro Gly Thr Asp 
1100 1105 1110 



3398 



ccg get ttt gag gtc ttc ccc tac acg ate acc aat gtg ttt tat 
Pro Ala Phe Glu Val Phe Pro Tyr Thr He Thr Asn Val Phe Tyr 
1115 1120 1125 



3443 



gag cag tac ctg acc ate etc cct gag ggg etc ttc atg etc age 



3488 
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Glu Gin Tyr Leu Thr lie Leu Pro Glu Gly Leu Phe Met Leu Ser 
1130 1135 1140 



etc tgc ctt gtg ccc acc ttc get gtc tec tgc etc ctg ctg ggc 

Leu Cys Leu Val Pro Thr Phe Ala Val Ser Cys Leu Leu Leu Gly 
1145 1150 1155 

ctg gac ctg cgc tec ggc etc etc aac ctg etc tec att gtc atg 

Leu Asp Leu Arg Ser Gly Leu Leu Asn Leu Leu Ser lie Val Met 
1160 1165 1170 

ate etc gtg gac act gtc ggc ttc atg gee ctg tgg ggc ate agt 

lie Leu Val Asp Thr Val Gly Phe Met Ala Leu Trp Gly lie Ser 
1175 1180 1185 



3533 



3578 



3623 



tac aat get gtg tec etc ate aac ctg gtc teg gcg gtg ggc atg 

Tyr Asn Ala Val Ser Leu lie Asn Leu Val Ser Ala Val Gly Met 
1190 1195 1200 

tct gtg gag ttt gtg tec cac att acc cgc tec ttt gee ate age 

Ser Val Glu Phe Val Ser His lie Thr Arg Ser Phe Ala lie Ser 
1205 1210 1215 



3668 



3713 



acc aag ccc acc tgg ctg gag agg gee aaa gag gee acc ate tct 
Thr Lys Pro Thr Trp Leu Glu Arg Ala Lys Glu Ala Thr He Ser 
1220 1225 1230 

atg gga agt gcg gtg ttt gca ggt gtg gee atg acc aac ctg cct 
Met Gly Ser Ala Val Phe Ala Gly Val Ala Met Thr Asn Leu Pro 
1235 1240 1245 



3758 



3803 



ggc ate ctt gtc ctg ggc etc gee aag gee cag etc att cag ate 
Gly He Leu Val Leu Gly Leu Ala Lys Ala Gin Leu He Gin He 
1250 1255 1260 

ttc ttc ttc cgc etc aac etc ctg ate act ctg ctg ggc ctg ctg 
Phe Phe Phe Arg Leu Asn Leu Leu He Thr Leu Leu Gly Leu Leu 
1265 1270 1275 

cat ggc ttg gtc ttc ctg ccc gtc ate etc age tac gtg ggg cct 

His Gly Leu Val Phe Leu Pro Val He Leu Ser Tyr Val Gly Pro 
1280 1285 1290 

gac gtt aac ccg get ctg gca ctg gag cag aag egg get gag gag 

Asp Val Asn Pro Ala Leu Ala Leu Glu Gin Lys Arg Ala Glu Glu 
1295 1300 1305 

gcg gtg gca gca gtc atg gtg gec tct tgc cca aat cac ccc tec 
Ala Val Ala Ala Val Met Val Ala Ser Cys Pro Asn His Pro Ser 
1310 1315 * 1320 

cga gtc tec aca get gac aac ate tat gtc aac cac age ttt gaa 
Arg Val Ser Thr Ala Asp Asn He Tyr Val Asn His Ser Phe Glu 
1325 1330 1335 

ggt tct ate aaa ggt get ggt gec ate age aac ttc ttg ccc aac 
Gly Ser He Lys Gly Ala Gly Ala He Ser Asn Phe Leu Pro Asn 
1340 1345 1350 



3848 



3893 



3938 



3983 



4028 



4073 



4118 
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aat ggg egg cag ttc tga tacagecaga ggccctgtct aggctctatg 4166 

Asn Gly Arg Gin Phe 

1355 

gccctgaacc aaagggttat ggggatcttc cttgtgactg ccccttgaca cacgccctcc 4226 

tcaaatccta ggggaggeca ttcccatgag actgcctgtc actggaggat ggcctgctct 4286 

tgaggtatcc aggcagcacc actgatggct cctctgctcc catagtgggt ccccagtttc 4346 

caagtcacct aggccttggg cagtgcctcc tcctgggcct gggtctggaa gttggcagga 4 406 

acagacacac tccatgtttg tcccacactc actcactttc ctaggagccc acttctcatc 44 66 

caacttttcc cttctcagtt cctctctcga aagtcttaat tctgtgtcag taagtcttta 4526 

acaegtagea gtgtccctga gaacacagac aatgaccact accctgggtg tgatatcaca 4586 

ggaggecaga gagaggcaaa ggctcaggcc aagagecaac gctgtgggag gccggtcggc 4 646 

agccactccc tccagggcgc acctgeaggt ctgccatcca cggccttttc tggcaagaga 4706 

agggeccagg aaggatgetc teataaggee caggaaggat gctctcataa gcaccttggt 4766 

catggattag cccctcctgg aaaatggtgt tgggtttggt ctccagctcc aatacttatt 4826 

aaggctgttg ctgccagtca aggccaccca ggagtctgaa ggctgggagc tcttggggct 4886 

gggctggtcc tcccatcttc acctcgggcc tggatcccag gcctcaaacc agcccaaccc 4 94 6 

gagcttttgg acagctctcc agaagcatga actgcagtgg agatgaagat cctggctctg 5006 

tgctgtgcac ataggtgttt aataaacatt tgttggcaga aaaaaaaaaa aaaaaaaaaa 5066 

aaaaaaaaaa aaaaaaaaaa aaaaaa 5092 

<210> 44 

<211> 1359 

<212> PRT 

<213> Homo sapiens 



<400> 44 

Met Ala Glu Ala Gly Leu Arg Gly Trp Leu Leu Trp Ala Leu Leu Leu 
1 5 10 15 

Arg Leu Ala Gin Ser Glu Pro Tyr Thr Thr lie His Gin Pro Gly Tyr 

20 25 30 



Cys Ala Phe Tyr Asp Glu Cys Gly Lys Asn Pro Glu Leu Ser Gly Ser 
35 40 45 
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Leu Met Thr Leu Ser Asn Val Ser Cys Leu Ser Asn Thr Pro Ala Arg 
50 55 60 

Lys lie Thr Gly Asp His Leu He Leu Leu Gin Lys He Cys Pro Arg 
65 70 75 ~ 80 

Leu Tyr Thr Gly Pro Asn Thr Gin Ala Cys Cys Ser Ala Lys Gin Leu 

85 90 95 

Val Ser Leu Glu Ala Ser Leu Ser He Thr Lys Ala Leu Leu Thr Arg 

100 105 110 

Cys Pro Ala Cys Ser Asp Asn Phe Val Asn Leu His Cys His Asn Thr 
115 120 125 

Cys Ser Pro Asn Gin Ser Leu Phe He Asn Val Thr Arg Val Ala Gin 
130 135 140 

Leu Gly Ala Gly Gin Leu Pro Ala Val Val Ala Tyr Glu Ala Phe Tyr 
145 150 155 160 

Gin His Ser Phe Ala Glu Gin Ser Tyr Asp Ser Cys Ser Arg Val Arg 

165 170 175 

Val Pro Ala Ala Ala Thr Leu Ala Val Gly Thr Met Cys Gly Val Tyr 

180 185 " ^ 190 

Gly Ser Ala Leu Cys Asn Ala Gin Arg Trp Leu Asn Phe Gin Gly Asp 
195 200 205 

Thr Gly Asn Gly Leu Ala Pro Leu Asp He Thr Phe His Leu Leu Glu 
210 215 220 

Pro Gly Gin Ala Val Gly Ser Gly He Gin Pro Leu Asn Glu Gly Val 
225 230 235 240 

Ala Arg Cys Asn Glu Ser Gin Gly Asp Asp Val Ala Thr Cys Ser Cys 

245 250 255 

Gin Asp Cys Ala Ala Ser Cys Pro Ala He Ala Arg Pro Gin Ala Leu 

260 265 - 270 



Asp Ser Thr Phe Tyr Leu Gly Gin Met Pro Gly Ser Leu Val Leu He 
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275 280 285 



lie lie Leu Cys Ser Val Phe Ala Val Val Thr lie Leu Leu Val Gly 
290 295 300 



Phe Arg Val Ala Pro Ala Arg Asp Lys Ser Lys Met Val Asp Pro Lys 
305 310 315 320 



Lys Gly Thr Ser Leu Ser Asp Lys Leu Ser Phe Ser Thr His Thr Leu 

325 330 335 



Leu Gly Gin Phe Phe Gin Gly Trp Gly Thr Trp Val Ala Ser Trp Pro 

340 345 350 



Leu Thr lie Leu Val Leu Ser Val He Pro Val Val Ala Leu Ala Ala 
355 360 365 



Gly Leu Val Phe Thr Glu Leu Thr Thr Asp Pro Val Glu Leu Trp Ser 
370 375 380 



Ala Pro Asn Ser Gin Ala Arg Ser Glu Lys Ala Phe His Asp Gin His 
385 390 " 395 400 



Phe Gly Pro Phe Phe Arg Thr Asn Gin Val He Leu Thr Ala Pro Asn 

405 .410 415 



Arg Ser Ser Tyr Arg Tyr Asp Ser Leu Leu Leu Gly Pro Lys Asn Phe 

420 425 430 



Ser Gly He Leu Asp Leu Asp Leu Leu Leu Glu Leu Leu Glu Leu Gin 
435 • 440 445 



Glu Arg Leu Arg His Leu Gin Val Trp Ser Pro Glu Ala Gin Arg Asn 
450 455 460 

He Ser Leu Gin Asp He Cys Tyr Ala Pro Leu Asn Pro Asp Asn Thr 
465 470 475 480 

Ser Leu Tyr Asp Cys Cys He Asn Ser Leu Leu Gin Tyr Phe Gin Asn 

485 490 495 



Asn Arg Thr Leu Leu Leu Leu Thr Ala Asn Gin Thr Leu Met Gly Gin 

500 505 510 
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Thr Ser Gin Val Asp Trp Lys Asp His Phe Leu Tyr Cys Ala Asn Ala 
515 520 525 



Pro Leu Thr Phe Lys Asp Gly Thr Ala Leu Ala Leu Ser Cys Met Ala 
530 535 540 



Asp Tyr Gly Ala Pro Val Phe Pro Phe Leu Ala lie Gly Gly Tyr Lys 
545 550 555 560 

Gly Lys Asp Tyr Ser Glu Ala Glu Ala Leu He Met Thr Phe Ser Leu 

565 570 575 

Asn Asn Tyr Pro Ala Gly Asp Pro Arg Leu Ala Gin Ala Lys Leu Trp 

580 585 590 

Glu Glu Ala Phe Leu Glu Glu Met Arg Ala Phe Gin Arg Arg Met Ala 
595 600 605 



Gly Met Phe Gin Val Thr Phe Met Ala Glu Arg Ser Leu Glu Asp Glu 
610 615 620 



He Asn Arg Thr Thr Ala Glu Asp Leu Pro lie Phe Ala Thr Ser Tyr 
625 630 635 640 

* 

He Val He Phe Leu Tyr He Ser Leu Ala Leu Gly Ser Tyr Ser Ser 

645 650 ^ ^ 655 

Trp Ser Arg Val Met Val Asp Ser Lys Ala Thr Leu Gly Leu Gly Gly 

660 665 670 

Val Ala Val Val Leu Gly Ala Val Met Ala Ala Met Gly Phe Phe Ser 
675 680 685 

Tyr Leu Gly He Arg Ser Ser • Leu Val He Leu Gin Val Val Pro Phe 
690 695 700 

Leu Val Leu Ser Val Gly Ala Asp Asn He Phe He Phe Val Leu Glu 
705 710 715 720 



Tyr Gin Arg Leu Pro Arg Arg Pro Gly Glu Pro Arg Glu Val His He 

725 730 735 



Gly Arg Ala Leu Gly Arg Val Ala Pro Ser Met Leu Leu Cys Ser Leu 

740 745 750 
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Ser Glu Ala lie Cys Phe Phe Leu Gly Ala Leu Thr Pro Met Pro Ala 
755 760 765 

Val Arg Thr Phe Ala Leu Thr Ser Gly Leu Ala Val lie Leu Asp Phe 
770 775 780 

Leu Leu Gin Met Ser Ala Phe Val Ala Leu Leu Ser Leu Asp Ser Lys 
785 790 795 800 

Arg Gin Glu Ala Ser Arg Leu Asp Val Cys Cys Cys Val Lys Pro Gin 

805 810 815 

Glu Leu Pro Pro Pro Gly Gin Gly Glu Gly Leu Leu Leu Gly Phe Phe 

820 825 830 

Gin Lys Ala Tyr Ala Pro Phe Leu Leu His Trp He Thr Arg Gly Val 
835 840 845 

Val Leu Leu Leu Phe Leu Ala Leu Phe Gly Val Ser Leu Tyr Ser Met 
850 855 860 

Cys His He Ser Val Gly Leu Asp Gin Glu Leu Ala Leu Pro Lys Asp 
865 870 875 880 

Ser Tyr Leu Leu Asp Tyr Phe Leu Phe Leu Asn Arg Tyr Phe Glu Val 

885 890 895 

Gly Ala Pro Val Tyr Phe Val Thr Thr Leu Gly Tyr Asn Phe Ser Ser 

900 905 " 910 

Glu Ala Gly Met Asn Ala He Cys Ser Ser Ala Gly Cys Asn Asn Phe 
915 920 925 

Ser Phe Thr Gin Lys He Gin Tyr Ala Thr Glu Phe Pro Glu Gin Ser 
930 935 940 

Tyr Leu Ala lie Pro Ala Ser Ser Trp Val Asp Asp Phe He Asp Trp 
945 950 955 960 

Leu Thr Pro Ser Ser Cys Cys Arg Leu Tyr He Ser Gly Pro Asn Lys 

965 970 975 

Asp Lys Phe Cys Pro Ser Thr Val Asn Ser Leu Asn Cys Leu Lys Asn 

980 985 990 
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Cys Met Ser lie Thr Met Gly Ser Val Arg Pro Ser Val Glu Gin Phe 
995 1000 1005 

His Lys Tyr Leu Pro Trp Phe Leu Asn Asp Arg Pro Asn lie Lys 
1010 1015 1020 

Cys Pro Lys Gly Gly Leu Ala Ala Tyr Ser Thr Ser Val Asn Leu 
1025 1030 1035 

Thr Ser Asp Gly Gin Val Leu Asp Thr Val Ala lie Leu Ser Pro 
1040 1045 1050 

Arg Leu Glu Tyr Ser Gly Thr lie Ser Ala His Cys Asn Leu Tyr 
1055 1060 1065 

Leu Leu Asp Ser Ala Ser Arg Phe Met Ala Tyr His Lys Pro Leu 
1070 1075 " 1080 

Lys Asn Ser Gin Asp Tyr Thr Glu Ala Leu Arg Ala Ala Arg Glu 
1085 1090 1095 

Leu Ala Ala Asn He Thr Ala Asp Leu Arg Lys Val Pro Gly Thr 
1100 H05 ~ 1110 

Asp Pro Ala Phe Glu Val Phe Pro Tyr Thr He Thr Asn Val Phe 
1115 H20 1125 

Tyr Glu Gin Tyr Leu Thr He Leu Pro Glu Gly Leu Phe Met Leu 
1130 1135 1140 

Ser Leu Cys Leu Val Pro Thr Phe Ala Val Ser Cys Leu Leu Leu 
1145 1150 H55 

Gly Leu Asp Leu Arg Ser Gly Leu Leu Asn Leu Leu Ser He Val 
1160 1165 H70 

Met He Leu Val Asp Thr Val Gly Phe Met Ala Leu Trp Gly He 
1175 1180 H85 

Ser Tyr Asn Ala Val Ser Leu He Asn Leu Val Ser Ala Val Gly 
1190 1195 1200 



Met Ser Val Glu Phe Val Ser His He Thr Arg Ser Phe Ala He 
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1205 1210 1215 

Ser Thr Lys Pro Thr Trp Leu Glu Arg Ala Lys Glu Ala Thr He 
1220 1225 1230 

Ser Met Gly Ser Ala Val Phe Ala Gly Val Ala Met Thr Asn Leu 
1235 1240 1245 

Pro Gly He Leu Val Leu Gly Leu Ala Lys Ala Gin Leu He Gin 
1250 1255 1260 

He Phe Phe Phe Arg Leu Asn Leu Leu He Thr Leu Leu Gly Leu 
1265 1270 1275 

Leu His Gly Leu Val Phe Leu Pro Val He Leu Ser Tyr Val Gly 
1280 1285 1290 

Pro Asp Val Asn Pro Ala Leu Ala Leu Glu Gin Lys Arg Ala Glu 
1295 1300 1305 

Glu Ala Val Ala Ala Val Met Val Ala Ser Cys Pro Asn His Pro 
1310 1315 1320 

Ser Arg Val Ser Thr Ala Asp Asn He Tyr Val Asn His Ser Phe 
1325 1330 1335 

Glu Gly Ser He Lys Gly Ala Gly Ala He Ser Asn Phe Leu Pro 
1340 1345 1350 

Asn Asn Gly Arg Gin Phe 
1355 

<210> 45 

<211> 4471 

<212> DNA 

<213> Mus mus cuius 



<400> 45 

ggatcacttc ctggctctgg gatggcagct gcctggcagg gatggctgct ctgggccctg 60 

ctcctgaatt cggcccaggg tgagctctac acacccactc acaaagctgg cttctgcacc 120 

ttttatgaag agtgtgggaa gaacccagag ctttctggag gcctcacatc actatccaat 180 
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atctcctgct tgtctaatac cccagccccg ccatgtcaca ggtgaccacc tggctcttct 240 

ccagcgcgtc tgtccccgcc tatacaatgg ccccaatgac acctatgcct gttgctctac 300 

caagcagctg gtgtcattag acagtagcct gtctatcacc aaggccctcc ttacacgctg 360 

cccggcatgc tctgaaaatt ttgtgagcat acactgtcat aatacctgca gccctgacca 420 

gagcctcttc atcaatgtta ctcgcgtggt tcagcgggac cctggacagc ttcctgctgt 480 

ggtggcctat gaggcctttt atcaacgcag ttttgcagag aaggcctatg agtcctgtag 540 

ccgggtgcgc atccctgcag ctgcctcgct ggctgtgggc agcatgtgtg gagtgtatgg 600 

ctctgccctc tgcaatgctc agcgcctggc tcaacttcca aggagacaca gggaatggcc 660 

tggctccgct ggacatcacc ttccacctct tggagcctgg ccaggccctg gcagatggga 720 

tgaagccact ggatgggaag atcaaaccct gcaatgagtc ccagggtgaa gactcggcag 780 

cctgrttcctg ccaggactgt gcagcatcct gccctgtcat ccctccgccc ccggccctgc 840 

gcccttcttt ctacatgggt cgaatgccag gctggctggc tctcatcatc atcttcactg 900 

ctgtctttgt attgctctct gttgtccttg tgtatctccg agtggcttcc aacaggaaca 960 

agaacaagac agcaggctcc caggaagccc ccaacctccc tcgtaagcgc agattctcac 1020 

ctcacactgt ccttggccgg ttcttcgaga gctggggaac aatggtggcc tcatggccac 1080 

tcactgtctt ggcactgtcc ttcatagttg tgatagcctt gtcagtaggc ctgaccttta 1140 

tagaactcac cacagaccct gtggaactgt ggtcggcccc taaaagccaa gcccggaaag 1200 

aaaaggcttt ccatgacgag cattttggcc ccttcttccg aaccaaccag atttttgtga 1260 

cagctaagaa caggtccagc tacaagtacg actccctgct gctagggccc aagaacttca 1320 

gtgggatcct atccctggac ttgctgcagg agctgttgga gctacaggag agacttcgac 1380 

acctgcaagt gtggtcccat gaggcacagc gcaacatctc cctccaggac atctgctatg 144 0 

ctcccctcaa accgcataac accagcctca ctgactgctg tgtcaacagc ctccttcaat 1500 

acttccagaa caaccacaca ctcctgctgc tcacagccaa ccagactctg aatggccaga 1560 

cctccctggt ggactggaag gaccatttcc tctactgtgc caatgcccct ctcacgtaca 1620 

aagatggcac agccctggcc ctgagctgca tagctgacta cggggcgcct gtcttcccct 1680 

tccttgctgt tgggggctac caagggacgg actactcgga ggcagaagcc ctgatcataa 1740 

ccttctctat caat'aactac cccgctgatg atccccgcat ggcccacgcc aagctctggg 1800 

aggaggcttt cttgaaggaa atgcaatcct tccagagaag cacagctgac aagttccaga I860 

ttgcgttctc agctgagcgt tctctggagg acgagatcaa tcgcactacc atccaggacc 1920 

tgcctgtctt tgccatcagc taccttatcg tcttcctgta catctccctg gccctgggca 1980 
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gctactccag atggagccga gttgcggtgg attccaaggc tactctgggc ctaggtgggg 
tggctgttgt gctgggagca gtcgtggctg ccatgggctt ctactcctac ctgggtgtcc 
cctcctctct ggtcatcatt caagtggtac ctttcctggt gctggctgtg ggagctgaca 
acatcttcat ctttgttctt gagtaccaga ggctgcctag gatgcccggg gagcagcgag 
aggctcacat tggccgcacc ctgggtagtg tggcccccag catgctgctg tgcagcctct 
ctgaggccat ctgcttcttt ctaggggccc tgacctccat gccagctgtg aggacctttg 
ccttgacctc tggcttagca atcatctttg acttcctgct ccagatgaca gcctttgtgg 
ccctgctctc cctggatagc aagaggcagg aggcctctcg ccccgacgtc gtgtgctgct 
tttcaagccg 



gcaagatata 
ttctggtcct 
aggatctggc 



aaatctgccc ccaccgaaac aaaaagaagg cctcttactt tgcttcttcc 
cactcccttc ctgctgcaca gattcatccg ccctgttgtg ctgctgctct 
gtttggagca aacctctact taatgtgcaa catcagcgtg gggctggacc 
tctgcccaag gattcctacc tgatagacta cttcctcttt ctgaaccggt 



acttggaagt ggggcctcca gtgtactttg acaccacctc aggctacaac ttttccaccg 
aggcaggcat gaacgccatt tgctctagtg caggctgtga gagcttctcc ctaacccaga 
aaatccagta tgccagtgaa ttccctaatc agtcttatgt ggctattgct gcatcctcct 
gggtagatga cttcatcgac tggctgaccc catcctcctc ctgctgccgc atttataccc 
gtggccccca taaagatgag ttctgtccct caacggatac ttccttcaac tgtctcaaaa 
actgcatgaa ccgcactctg ggtcccgtga gacccacaac agaacagttt cataagtacc 
tgccctggtt cctgaatgat acgcccaaca tcagatgtct taaagggggc ctagcagcgt 
atagaacctc tgtgaatttg atctcagatg gccagattat agcctcccag ttcatggcct 

tctccgggca tcccggttgc 



accacaagcc 
tagcagccaa 
tcttccctta 



cttacggaac tcacaggact ttacagaagc 
catcacagct gaactacgga aggtgcctgg gacagatccc aactttgagg 
cacgatctcc aatgtgttct accagcaata 



cctgacggtt ctccctgagg 
gaatcttcac tcttgctctc tgcttcgtgc ccacctttgt ggtctgctac ctcctactgg 
gcctggacat acgctcaggc atcctcaacc tgctctccat cattatgatc ctcgtggaca 
ccatcggcct catggctgtg tggggtatca gctacaatgc tgtgtccctc atcaaccttg 
tcacggcagt gggcatgtct gtggagttcg tgtcccacat tacccggtcc tttgctgtaa 
gcaccaagcc tacccggctg gagagagcca aagatgctac tatcttcatg ggcagtgcgg 
tgtttgctgg agtggccatg accaacttcc cgggcatcct catcctgggc tttgctcagg 



2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 
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cccagcttat ccagattttc ttcttccgcc tcaacctcct gatcaccttg ctgggtctgc 3780 

tacacggcct ggtcttcctg cccgttgtcc tcagctatct ggggccagat gttaaccaag 3840 

ctctggtact ggaggagaaa ctagccactg aggcagccat ggtctcagag ccttcttgcc 3900 

cacagtaccc cttcccggct gatgcaaaca ccagtgacct atgttaacta aggctttaat 3960 

ccagaattta tccctgaaat taatgctgct agcagctctc tgcccaaaag tgaccaaaag 4020 

ttctaatgga gtaggagctt gtccaggctc catggttctt gctgataagg ggccacgagg 4080 

gtcttccctc tggttgtttc caaggcctgg ggaaagttgt tccagaaaaa aattgctggc 4140 

attcttgtcc tgaggcagcc agcactggcc actttgttgt cataggtccc cgaggccatg 4200 

atcagattac ctcctctgta aagagaatat cttgagtatt gtatgggatg tatcacatgt 4260 

caattaaaaa ggccatggcc tatggcttag gcaggaaata gggtgtggaa catccaggag 4320 

aagaaaggat tctgggataa aggacacttg ggaacgtgtg gcagtggtac ctgagcacag 4380 

gtaattagcc atgtggcgaa atgtagatta atataaatgc atatctaagt tatgattcta 4 440 

gtctagctat atggccaagg tatttataaa t 4 471 

<210> 46 
<211> 25 
<212> DNA 

<213> Artificial sequence 



<220> 

<223> primer 

<400> 46 

atgttaggtg agtctgaacc taccc 25 

<210> 47 

<211> 25 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> primer 
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<400> 47 

ggattgcatt tccttcaaga aagcc 25 

<210> 48 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 48 

tatggctctg ccctctgcaa tgctc 25 

<210> 49 

<211> 28 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> primer 

<400> 49 

tcagcagcct ctgttccaca tacacttc 28 

<210> 50 

<211> 25 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> primer 
<400> 50 

gttccacagg gtctgtggtg agttc 25 
<210> 51 
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<211> 3996 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<222> (1)..(3996) 

<223> n is g or a or t or c 



<400> 51 

atggcngarg cnggnytnmg nggntggytn ytntgggcny tnytnytnmg nytngcncar 60 

wsngarccnt ayacnacnat hcaycarccn ggntaytgyg cnttytayga ygartgyggn 120 * 

aaraayccng arytnwsngg nwsnytnatg acnytnwsna aygtnwsntg yytnwsnaay 180 

acnccngcnm gnaarathac nggngaycay ytnathytny tncaraarat htgyccnmgn 240 

ytntayacng gnccnaayac ncargcntgy tgywsngcna arcarytngt nwsnytngar 300 

gcnwsnytnw snathacnaa rgcnytnytn acnmgntgyc cngcntgyws ngayaaytty 360 

gtnaayytnc aytgycayaa yacntgywsn ccnaaycarw snytnttyat haaygtnacn 420 

mgngtngcnc arytnggngc nggncarytn ccngcngtng tngcntayga rgcnttytay 480 

carcaywsnt tygcngarca rwsntaygay wsntgywsnm gngtnmgngt nccngcngcn 540 

gcnacnytng cngtnggnac natgtgyggn gtntayggnw sngcnytntg yaaygcncar 600 

mgntggytna ayttycargg ngayacnggn aayggnytng cnccnytnga yathacrvtty 660 

cayytnytng arccnggnca rgcngtnggn wsnggnathc arccnytnaa ygarggngtn 720 

gcnmgntgya aygarwsnca rggngaygay gtngcnacnt gywsntgyca rgaytgygcn 780 

gcnwsntgyc cngcnathgc nmgnccncar gcnytngayw snacnttyta yytnggncar 840 

atgccnggnw snytngtnyt nathathath ytntgywsng tnttygcngt ngtnacnath 900 

ytnytngtng gnttymgngt ngcnccngcn mgngayaarw snaaratggt ngayccnaar 960 

aarggnacnw snytnwsnga yaarytnwsn ttywsnacnc ayacnytnyt nggncartty 1020 

ttycarggnt ggggnacntg ggtngcnwsn tggccnytna cnathytngt nytnwsngtn 1080 

athccngtng tngcnytngc ngcnggnytn gtnttyacng arytnacnac ngayccngtn 1140 

garytntggw sngcnccnaa ywsncargcn mgnwsngara argcnttyca ygaycarcay 1200 
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ttyggnccnt tyttymgnac naaycargtn athytnacng cnccnaaymg nwsnwsntay 1260 

mgntaygayw snytnytnyt nggnccnaar aayttywsng gnathytnga yytngayytn 1320 

ytnytngary tnytngaryt ncargarmgn ytnmgncayy tncargtntg gwsnccngar 1380 

gcncarmgna ayathwsnyt ncargayath tgytaygcnc cnytnaaycc ngayaayacn 14 4 0 

wsnytntayg aytgytgyat haaywsnytn ytncartayt tycaraayaa ymgnacnytn 1500 

ytnytnytna cngcnaayca racnytnatg ggncaracnw sncargtnga ytggaargay 1560 

cayttyytnt aytgygcnaa ygcnccnytn acnttyaarg ayggnacngc nytrxgcnytn 1620 

wsntgyatgg cngaytaygg ngcnccngtn ttyccnttyy tngcnathgg nggntayaar 1680 

ggnaargayt aywsngargc ngargcnytn athatgacnt tywsnytnaa yaaytayccn 174 0 

gcnggngayc cnmgnytngc ncargcnaar ytntgggarg argcnttyyt ngargaratg 1800 

mgngcnttyc armgnmgnat ggcnggnatg ttycargtna cnttyacngc ngarmgnwsn 1860 

ytngargayg arathaaymg nacnacngcn gargayytnc cnathttygc nacnwsntay 1920 

athgtnatht tyytntayat hwsnytngcn ytnggnwsnt aywsnwsntg gwsmngngtn 1980 

atggtngayw snaargcnac nytnggnytn ggnggngtng cngtngtnyt nggngcngtn 2040 

atggcngcna tgggnttytt ywsntayytn ggnathmgnw snwsnytngt nathytncar 2100 

gtngtnccnt tyytngtnyt nwsngtnggn gcngayaaya thttyathtt ygtnytngar 2160 

taycanngny tnccnmgnmg nccnggngar ccnmgngarg tncayathgg nmgngcnytn 2220 

ggnmgngtng cnccnwsnat gytnytntgy wsnytnwsng argcnathtg yttyttyytn 2280 

ggngcnytna cnccnatgcc ngcngtnmgn acnttygcny tnacnwsngg nytncjcngtn 2340 

athytngayt tyytnytnca ratgwsngcn ttygtngcny tnytnwsnyt ngaywsnaar 2400 

mgncargarg cnwsnmgnyt ngaygtntgy tgytgygtna arccncarga rytnccnccn 24 60 

ccnggncarg gngarggnyt nytnytnggn ttyttycara argcntaygc nccnfctyytn 2520 

ytncaytgga thacnmgngg ngtngtnytn ytnytnttyy tngcnytntt yggngtnwsn 2580 

ytntaywsna tgtgycayat hwsngtnggn ytngaycarg arytngcnyt nccnaargay 2640 

wsntayytny tngaytaytt yytnttyytn aaymgntayt tygargtngg ngcnccngtn 2700 

tayttygtna cnacnytngg ntayaaytty wsnwsngarg cnggnatgaa ygcnathtgy 2760 

wsnwsngcng gntgyaayaa yttywsntty acncaraara thcartaygc nacngartty 2820 

ccngarcarw sntayytngc nathccngcn wsnwsntggg tngaygaytt yathgaytgg 2880 

ytnacnccnw snwsntgytg ymgnytntay athwsnggnc cnaayaarga yaarttytgy 294 0 
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ccnwsnacng tnaaywsnyt naaytgyytn aaraaytgya tgwsnathac natgggnwsn 3000 

gtnmgnccnw sngtngarca rttycayaar tayytnccnt ggttyytnaa ygaymgnccn 3060 

aayathaart gyccnaargg nggnytngcn gcntaywsna cnwsngtnaa yytnacnwsn 3120 

gayggncarg tnytngcnws nmgnttyatg gcntaycaya arccnytnaa raaywsncar 3180 

gaytayacng argcnytnmg ngcngcnmgn garytngcng cnaayathac ngcngayytn 3240 

mgnaargtnc cnggnacnga yccngcntty gargtnttyc cntayacnat hacnaaygtn 3300 

ttytaygarc artayytnac nathytnccn garggnytnt tyatgytnws nytntgyytn 3360 

gtnccnacnt tygcngtnws ntgyytnytn ytnggnytng ayytnmgnws nggnytnytn 34 20 

aayytnytnw snathgtnat gathytngtn gayacngtng gnttyatggc nytntgggay 3480 

athwsntaya aygcngtnws nytnathaay ytngtnwsng cngtnggnat gwsngtngar 3540 

ttygtnwsnc ayathacnmg nwsnttygcn athwsnacna arccnacntg gytnganngn 3600 

gcnaargarg cnacnathws natgggnwsn gcngtnttyg cnggngtngc natgacnaay 3660 

ytnccnggna thytngtnyt nggnytngcn aargcncary tnathcarat httyttytty 3720 

mgnytnaayy tnytnathac nytnytnggn ytnytncayg gnytngtntt yytnccngtn 3780 

athytnwsnt aygtnggncc ngaygtnaay ccngcnytng cnytngarca raarmgngcn 384 0 

gargargcng tngcngcngt natggtngcn wsntgyccna aycayccnws nmgngtnwsn 3900 

acngcngaya ayathtaygt naaycaywsn ttygarggnw snathaargg ngcnggngcn 3960 

athwsnaayt tyytnccnaa yaayggnmgn cartty 3996 
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